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Abstract 

In this review we present a thoroughly comprehensive survey of recent work on modi- 
fied theories of gravity and their cosmological consequences. Amongst other things, we 
cover General Relativity, Scalar- Tensor, Einstein- Aether, and Bimetric theories, as well 
as TeVeS, f(R), general higher-order theories, Hofava-Lifschitz gravity, Galileons, Ghost 
Condensates, and models of extra dimensions including Kaluza-Klein, Randall-Sundrum, 
DGP, and higher co-dimension braneworlds. We also review attempts to construct a Pa- 
rameterised Post-Friedmannian formalism, that can be used to constrain deviations from 
General Relativity in cosmology, and that is suitable for comparison with data on the 
largest scales. These subjects have been intensively studied over the past decade, largely 
motivated by rapid progress in the field of observational cosmology that now allows, 
for the first time, precision tests of fundamental physics on the scale of the observable 
Universe. The purpose of this review is to provide a reference tool for researchers and stu- 
dents in cosmology and gravitational physics, as well as a self-contained, comprehensive 
and up-to-date introduction to the subject as a whole. 
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1. Introduction 



The General Theory of Relativity is an astounding accomplishment: Together with 
quantum field theory, it is now widely considered to be one of the two pillars of modern 
physics. The theory itself is couched in the language of differential geometry, and was 
a pioneer for the use of modern mathematics in physical theories, leading the way for 
the gauge theories and string theories that have followed. It is no exaggeration to say 
that General Relativity set a new tone for what a physical theory can be, and has truly 
revolutionised our understanding of the Universe. 

One of the most striking facts about General Relativity is that, after almost an entire 
century, it remains completely unchanged: The field equations that Einstein communica- 
tion to the Prussian Academy of Sciences in November 1915 are still our best description 
of how space-time behaves on macroscopic scales. These are 

_ 8ttG 

where G M „ is the Einstein tensor, T^ v is the energy momentum tensor, G is Newton's 
constant, and c is the speed of light. It is these equations that are thought to govern the 
expansion of the Universe, the behaviour of black holes, the propagation of gravitational 
waves, and the formation of all structures in the Universe from planets and stars all the 
way up to the clusters and super-clusters of galaxies that we are discovering today. It 
is only in the microscopic world of particles and high energies that General Relativity is 
thought to be inadequate. On all other scales it remains the gold standard. 

The great success of General Relativity, however, has not stopped alternatives being 
proposed. Even during the very early days after Einstein's publication of his theory there 
were proposals being made on how to extend it, and incorporate it in a larger, more 
unified theory. Notable examples of this are Eddington's theory of connections, Weyl's 
scale independent theory, and the higher dimensional theories of Kaluza and Klein. To 
some extent, these early papers were known to have been influential on Einstein himself. 
They certainly influenced the physicists who came after him. 

The ideas developed by Eddington during this period were later picked up by Dirac, 
who pointed out the apparent coincidence between the magnitude of Newton's constant 
and the ratio of the mass and scale of the Universe. This relationship between a funda- 
mental constant and the dynamical state of a particular solution led Dirac to conjecture 
that Newton's constant may, in fact, be varying with time. The possibility of a varying 
Newton's constant was picked up again in the 1960s by Brans and Dicke who developed 
the prototypical version of what are now known as scalar-tensor theories of gravity. These 
theories are still the subject of research today, and make up Section 3.1 of our report. 

Building on the work of Hermann Weyl, the Soviet physicist Andrei Sakharov pro- 
posed in 1967 what would prove to be one of the most enduring theories of modified 
gravity. In Sakharov's approach, the Einstein-Hilbert action, from which the Einstein 
field equations can be derived, is simply a first approximation to a much more complicated 
action: Fluctuations in space-time itself lead to higher powers corrections to Einstein's 
theory. In 1977 Kellogg Stelle showed formally that these theories are renormalizable in 
the presence of matter fields at the one loop level. This discovery was followed by a surge 
of interest, that was boosted again later on by the discovery of the potential cosmologi- 
cal consequences of these theories, as found by Starobinsky and others. In Section [4] we 
review this work. 
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The idea of constructing a quantum field theory of gravity started to take a front seat 
in physics research during the 1970s and 80s, with the rise of super-gravity and super- 
string theories. Both of these proposals rely on the introduction of super-symmetry, and 
signalled a resurgence in the ideas of Kaluza and Klein involving higher dimensional 
spaces. Boosted further by the discovery of D-branes as fundamental objects in string 
theories, this avenue of research led to a vastly richer set of structures that one could 
consider, and a plethora of proposals were made for how to modify the effective field 
equations in four dimensions. In Section [5] we review the literature on this subject. 

By the early 1970s, and following the 'golden age' of general relativity that took place 
in the 1960s, there was a wide array of candidate theories of gravity in existence that 
could rival Einstein's. A formalism was needed to deal with this great abundance of pos- 
sibilities, and this was provided in the form of the Parameterised Post-Newtonian (PPN) 
formalism by Kenneth Nordtvedt, Kip Thorne and Clifford Will. The PPN formalism 
was built on the earlier work of Eddington and Dicke, and allowed for the numerous 
theories available at the time to be compared to cutting edge astrophysical observations 
such as lunar laser ranging, radio echo, and, in 1974, the Hulse-Taylor binary pulsar. 
The PPN formalism provided a clear structure within which one could compare and as- 
sess various theories, and has been the benchmark for how theories of gravity should be 
evaluated ever since. We will give an outline of the PPN formalism, and the constraints 
available within it today, in Section [2] 

The limits of General Relativity have again come into focus with the emergence of 
the 'dark universe' scenario. For almost thirty years there has existed evidence that, if 
gravity is governed by Einstein's field equations, there should be a substantial amount of 
'dark matter' in galaxies and clusters. More recently, 'dark energy' has also been found 
to be required in order to explain the apparent accelerating expansion of the Universe. 
Indeed, if General Relativity is correct, it now seems that around 96% of the Universe 
should be in the form of energy densities that do not interact electromagnetically. Such 
an odd composition, favoured at such high confidence, has led some to speculate on the 
possibility that General Relativity may not, in fact, be the correct theory of gravity to 
describe the Universe on the largest scales. The dark universe may be just another signal 
that we need to go beyond Einstein's theory 

The idea of modifying gravity on cosmological scales has really taken off over the past 
decade. This has been triggered, in part, by theoretical developments involving higher 
dimensional theories, as well as new developments in constructing renormalizable theories 
of gravity. More phenomenologically, Bekenstein's relativistic formulation of Milgrom's 
Modified Newtonian Dynamics (MoND) has provided a fresh impetus for new study: 
What was previously a rule of thumb for how weak gravitational fields might behave in 
regions of low acceleration, was suddenly elevated to a theory that could be used to study 
cosmology. Insights such as Bertschinger's realisation that large-scale perturbations in 
the Universe can be directly related to the overall expansion rate have also made it 
possible to characterise large classes of theories simply in terms of how they make the 
Universe evolve. Finally, and just as importantly, there has been tremendous progress 
observationally. A key step here has been the measurement of the growth of structure 
at redshifts of z ~ 0.8, by Guzzo and his collaborators. With these measurements 
one can test, and reject, a large number of proposals for modified gravity. This work 
is complemented by many others that carefully consider the impact of modifications 
to gravity on the cosmic microwave background, weak lensing and a variety of other 

6 



cosmological probes. As a result, testing gravity has become one of the core tasks of 
many current, and future, cosmological missions and surveys. 

In this report we aim to provide a comprehensive exposition of the many developments 
that have occurred in the field of modified gravity over the past few decades. We will focus 
on how these theories differ from General Relativity, and how they can be distinguished 
from it, as well as from each other. A vast range of modified theories now exist in the 
literature. Some of these have extra scalar, vector or tensor fields in their gravitational 
sector; some take Sakharov's idea in an altogether new direction, modifying gravity in 
regions of low, rather than high, curvature; others expand on the ideas first put forward 
by Kaluza and Klein, and take them into new realms by invoking new structures. Indeed, 
as the reader will see from our table of contents, there are now a great many possible 
ways of modifying gravity that can, in principle, be tested against the real Universe. We 
will attempt to be as comprehensive in this report as we consider it reasonably possible 
to be. That is, we will attempt to cover as many aspects of as many different theories as 
we can. 

To be able to efficiently assess the different candidate theories of gravity we have opted 
to first lay down the foundations of modern gravitational physics and General Relativity 
in Section [2j We have aimed to make this a self-contained section that focuses, to some 
extent, on why general relativity should be considered 'special' among the larger class of 
possibilities that we might consider. In this section we also survey the current evidence 
for the 'dark universe', and explain why it has become the standard paradigm. From here 
we move on to discuss and compare alternative theories of gravity and their observational 
consequences. While the primary focus of this report is to elucidate particular theories, 
we will also briefly delve into the recent attempts that have been made to construct a 
formalism, analogous to the PPN formalism, for the cosmological arena. We dub these 
approaches 'Parameterised Post Friedmannian'. 

Let us now spell out the conventions and definitions that we will use throughout 
this review. We will employ the 'space-like convention' for the metric, such that when 

it is diagonalised it has the signature ( h ++)■ We will choose to write space-time 

indices using the Greek alphabet, and space indices using the Latin alphabet. Where 
convenient, we will also choose to use units such that speed of light is equal to 1. Under 
these conventions the line-element for Minkowski space, for example, can then be written 

ds 2 = -q^dx^dx" = -dt 2 + dx 2 + dy 2 + dz 2 . (2) 

For the Riemann and Einstein curvature tensors we will adopt the conventions of Misncr, 
Thornc and Wheeler [502] : 

G^v = — -g^R, (4) 

where = R a „ a „ and R — R a a . The energy- momentum tensor will be defined with 
respect to the Lagrangian density for the matter fields as 



where the derivative here is a functional one. Throughout this review we will refer to 
the energy density of a fluid as p, and its isotropic pressure as P. The equation of state, 
w, is then defined by 

P = wp. (6) 

When writing the Friedmann-Lemaitre-Robertson- Walker (FLRW) line-element we will 
use t to denote the 'physical time' (proper time of observers comoving with the fluid), 
and t = J dt/a(t) to denote the 'conformal time' coordinate. Unless otherwise stated, 
when working with linear perturbations about an FLRW background we will work in the 
conformal Newtonian gauge in which 

ds 2 = a 2 (r) [-(1 + 2^)dr 2 + (1 - 2$)q ij dx i dx j ] , (7) 

where qij is the metric of a maximally symmetric 3-space with Gaussian curvature k: 

dr 2 

<fs 2 = qi.dx'dx 1 = 5 + r 2 d6 2 + r 2 sin 2 9d(j) 2 . (8) 

1 — nr l 

When dealing with time derivatives in cosmology we will use the dot and prime operators 
to refer to derivatives with respect to physical and conformal time, respectively, such that 

' - I < 9 > 
' ^ I <*> 

In four dimensional space-time we will denote covariant derivatives with either a semi- 
colon or a V M . The four dimensional d'Alembertian will then be defined as 

n = grv^ v . (11) 

On the conformally static three-dimensional space-like hyper-surfaces the grad operator 
will be denoted with an arrow, as Vj, while the Laplacian will be given by 

A ,fVX,. (12) 

As is usual, we will often make use of the definition of the Hubble parameter defined 
with respect to both physical and conformal time as 

H ee h - (13) 
a 

H EE -. (14) 

a 

The definitions we have made here will be restated at various points in the review, so 
that each section remains self-contained to a reasonable degree. The exception to this 
will be Section [5j on higher dimensional theories, which will require the introduction of 
new notation in order to describe quantities in the bulk. 

Let us now move onto the definitions of particular terms. We choose to define the 
equivalence principles in the following way: 



• Weak Equivalence Principle (WEP): All uncharged, freely falling test particles fol- 
low the same trajectories, once an initial position and velocity have been prescribed. 

• Einstein Equivalence Principle (EEP): The WEP is valid, and furthermore in all 
freely falling frames one recovers (locally, and up to tidal gravitational forces) the 
same laws of special relativistic physics, independent of position or velocity. 

• Strong Equivalence Principle (SEP): The WEP is valid for massive gravitating ob- 
jects as well as test particles, and in all freely falling frames one recovers (locally, 
and up to tidal gravitational forces) the same special relativistic physics, indepen- 
dent of position or velocity. 

Of these, the EEP in particular is known to have been very influential in the conception 
of General Relativity. One may note that some authors refer to what we have called the 
EEP as the 'strong equivalence principle'. 

Let us now define what we mean by 'General Relativity'. This term is often used 
by cosmologists to refer simply to Einstein's equations. Particle physicists, on the other 
hand, refer to any dynamical theory of spin-2 fields that incorporates general covariance 
as 'general relativity', even if it has field equations that are different from Einstein's^ In 
this report when we write about 'General Relativity' we refer to a theory that simulta- 
neously exhibits general covariance, and universal couplings to all matter fields, as well 
as satisfying Einstein's field equations. When we then discuss 'modified gravity' this 
will refer to any modification of any of these properties. However, it will be clear from 
reading through this report that almost all the proposals we report on preserve general 
covariance, and the universality of free fall. Let us now clarify further what exactly we 
mean by 'modified' theories of gravity. 

As we will discuss in the next section, the effect of gravity on matter is tightly 
constrained to be mediated by interactions of the matter fields with a single rank-2 tensor 
field. This does not mean that this field is the only degree of freedom in the theory, but 
that whatever other interactions may occur, the effect of gravity on the matter fields can 
only be through interactions with the rank-2 tensor (up to additional weak interactions 
that are consistent with the available constraints). The term 'gravitational theory' can 
then be functionally defined by the set of field equations obeyed by the rank-2 tensor, 
and any other non-matter fields it interacts with. If these equations are anything other 
than Einstein's equations, then we consider it to be a 'modified theory of gravity'. We 
will not appeal to the action or Lagrangian of the theory itself here; our definition is an 
entirely functional one, in terms of the field equations alone. 

While we have constructed the definition above to be as simple as possible, there are 
of course a number of ambiguities involved. Firstly, exactly what one should consider as 
a 'matter field' can be somewhat subjective. This is especially true in terms of the exotic 
fields that are sometimes introduced into cosmology in order to try and understand 
the apparent late-time accelerating expansion of the Universe. Secondly, we have not 
defined exactly what we mean by 'Einstein's equations'. In four dimensions it is usually 
clear what this term refers to, but if we allow for the possibility of extra dimensions 



Note that under this definition the Einstein-Hilbert and Brans-Dicke Lagrangians, for example, 
represent different models of the same theory, which is called General Relativity. 
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then we may choose for it to refer either to the equations derived from an Einstein- 
Hilbert action in the higher dimensional space-time, or to the effective set of equations 
in four dimensional space-time. Clearly these two possible definitions are not necessarily 
consistent with each other. Even in four dimensions it is not always clear if 'Einstein's 
equations' include the existence of a non-zero cosmological constant, or not. 

To a large extent, the ambiguities just mentioned are a matter of taste, and have no 
baring on the physics of the situation. For example, whether one chooses to refer to the 
cosmological constant as a modification of gravity, as an additional matter field, or as 
part of Einstein's equations themselves makes no difference to its effect on the expansion 
of the Universe. In this case it is only convention that states that the Einstein equations 
with A is not a modified theory of gravity. Although less established than the case 
of the cosmological constant, similar conventions have started to develop around other 
modifications to the standard theory. For example, quintessence fields that are minimally 
coupled to the metric are usually thought of as additional matter fields, whereas scalar 
fields that non-minimally couple to the Einstein-Hilbert term in the action are usually 
thought of as being 'gravitational' fields (this distinction existing despite what numerous 
studies call non-minimally coupled quintessence fields). Although not always clear, we 
try to follow what we perceive to be the conventions that exist in the literature in this 
regard. We therefore include in this review a section on non-minimally coupled scalar- 
tensor theories, but not a section on minimally coupled quintessence fields. 
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2. General Relativity, and its Foundations 

General Relativity is the standard theory of gravity. Here we will briefly recap some of 
its essential features, and foundations. We will outline the observational tests of gravity 
that have been performed on Earth, in the solar system, and in other astrophysical 
systems, and we will then explain how and why it is that General Relativity satisfies 
them. We will outline why General Relativity should be considered a special theory in 
the more general class of theories that one could consider, and will present some of the 
theorems it obeys as well as the apparatus that is most frequently used to parameterise 
deviations away from it. This will be followed by a discussion of the cosmological solutions 
and predictions of the concordance general relativistic ACDM model of the Universe. 

2.1. Requirements for Validity 

In order to construct a relativistic theory of gravity it is of primary importance to 
establish the properties it must satisfy in order for it to be considered viable. These 
include foundational requirements, such as the universality of free fall and the isotropy 
of space, as well as compatibility with a variety of different observations involving the 
propagation of light and the orbits of massive bodies. Today, radio and laser signals 
can be sent back and forth from the Earth to spacecraft, planets and the moon, and 
detailed observations of the orbits of a variety of different astrophysical bodies allow 
us to look for ever smaller deviations from Newtonian gravity, as well as entirely new 
gravitational effects. It is in this section that we will discuss the gravitational experiments 
and observations that have so far been performed in these environments. We will discuss 
what they can tell us about relativity theory, and the principles that a theory must obey 
in order for it to stand a chance of being considered observationally viable. 

2.1.1. The foundations of relativistic theories 

First of all let us consider the equivalence principles. We will not insist immediately 
that any or all of these principles are valid, but will rather reflect on what can be said 
about them experimentally. This will allow us to separate out observations that test 
equivalence principles, from observations that test the different gravitational theories 
that obey these principles - an approach pioneered by Dicke |423j . 

The least stringent of the equivalence principles is the WEP. The best evidence in 
support of the WEP still comes from Eotvos type experiments that use a torsion balance 
to determine the relative acceleration of two different materials towards distant astro- 
physical bodies. In reality these materials are self-gravitating, but their mass is usually 
small enough that they can effectively be considered to be non-gravitating test particles 
in the gravitational field of the astrophysical body. Using beryllium and titanium the 
tightest constraint on the relative difference in accelerations of the two bodies, a± and 
a2, is currently [1110] 

= 2 K-tt 2 | = (Q 3 ± Lg) x 10 _i3 (15) 
\ai+a 2 \ 

This is an improvement of around 4 orders of magnitude on the original results of Eotvos 
from 1922 [472]. It is expected that this can be improved upon by up to a further 5 
orders of magnitude when space based tests of the equivalence principle are performed 
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1282 . These null results are generally considered to be a very tight constraint on the 
foundations of any relativistic gravitational theory if it is to be thought of as viable: The 
WEP must be satisfied, at least up to the accuracy specified in Eq. (15). 

Let us now briefly consider the gravitational redshifting of light. This is one of the 
three "classic tests" of General Relativity, suggested by Einstein himself in 1916 |465j . 
It is not, however, a particularly stringent test of relativity theory. If we accept energy- 
momentum conservation in a closed system then it is only really a test of the WEP, 
and is superseded in its accuracy by the Eotvos experiment we have just discussed. The 
argument for this is the following [4231 [S94 : Consider an atom that initially has an 
inertial mass Mi, and a gravitational mass M g . The atom starts near the ceiling of a 
lab of height h, in a static gravitational field of strength g, and with an energy reservoir 
on the lab floor beneath it. The atom emits a photon of energy E that then travels 
down to the lab floor, such that its energy has been blue-shifted by the gravitational 
field to E' when it is collected in the reservoir. This process changes the inertial and 
gravitational masses of the atom to M[ and M' g , respectively. The atom is then lowered 
to the floor, a process which lowers its total energy by M' g gh. At this point, the atom 
re-absorbs a photon from the reservoir with energy E' = (M" — M[)(? and is then 
raised to its initial position at the ceiling. This last process raises its energy by M' g 'gh, 
where here M" and M'J are the inertial and gravitational masses of the atom after re- 
absorbing the photon. The work done in lowering and raising the atom in this way is 
then w = (M'J — M' g )gh. Recalling that the energy gained by the photon in travelling 
from the lab ceiling to the lab floor is E' — E, the principle of energy conservation then 
tells us that (£" - E) = w = (M'J - M' g )gh. Now, if the WEP is obeyed then M t = M g , 
and this equation simply becomes (E' — E) = E'gh. This, however, is just the usual 
expression for gravitational redshift. Crucial here is the assumption that local position 
invariance is valid, so that both Mj and M g are independent of where they are in the lab. 
If the laws of physics are position independent, and energy is conserved, gravitational 
redshift then simply tests the equivalence of gravitational and inertial masses, which 
is what the Eotvos experiment does to higher accuracy. Alternatively, if we take the 
WEP to be tightly constrained by the Eotvos experiment, then gravitational redshift 
experiments can be used to gain high precision constraints on the position dependence 
of the laws of physics [118] . The gravitational redshift effect by itself, however, does not 
appear to be able to distinguish between different theories that obey the WEP and local 
position invariance. In Dicke's approach it should therefore be considered as a test of 
the foundations of relativistic gravitational theories, rather than a test of the theories 
themselves. 

The next most stringent equivalence principle is the EEP. Testing this is a consid- 
erably more demanding task than was the case for the WEP, as one now not only has 
to show that different test particles follow the same trajectories, but also that a whole 
set of special relativistic laws are valid in the rest frames of these particles. Despite 
the difficulties involved with this, there is still very compelling evidence that the EEP 
should also be considered valid to high accuracy. The most accurate and direct of this 
evidence is due to the Hughes-Drever experiments 633, 433 , which test for local spatial 
anisotropics by carefully observing the shape and spacing of atomic spectral lines. The 
basic idea here is to determine if any gravitational fields beyond a single rank-2 tensor 
are allowed to couple directly to matter fields. To see why this is of importance, let us 
first consider a number of point-like particles coupled to a single rank-2 tensor, g^ v . The 
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Lagrangian density for such a set of particles is given by 



E 



m I y/-g^u^u' y dX, (16) 



where to/ are the masses of the particles, and u^ 1 = dx^/dX is their 4- velocity measured 
with respect to some parameter A. The Euler-Lagrange equations derived from 5C = 



then tell us that the particles in Eq. (16) follow geodesies of the metric <7 M „, and 
Riemannian geometry tells us that at any point we can choose coordinates such that 
9uu = Vtiv locally. We therefore recover special relativity at every point, and the EEP 
is valid. Now, if the matter fields couple to two rank-2 tensors then the argument used 
above falls apart. In this case the Lagrangian density of our particles reads 

C = J2 / [miy/-9^W + n Iy /-h lll/ ui'u l/ dX, (17) 

where is the new tensor, and ri] is the coupling of each particle to that field. The 
particles above can now no longer be thought of as following the geodesies of any one 



metric, as the Euler-Lagrange Equations (17) are not in the form of geodesic equations 



We therefore have no Riemannian geometry with which we can locally transform to 
Minkowski space, and the EEP is violated. The relevance of this discussion for the 



Hughes-Drever experiments is that EEP violating couplings, such as those in Eq. (17), 
cause just the type of spatial anisotropics that these experiments constrain. In this case 
the 4-momentum of the test particle in these experiments becomes 

= i i + i , i , (18) 

and as g^ u and /i M „ cannot in general be made to be simultaneously spatially isotropic, 
we then have that p^ is spatially anisotropic, and should cause the type of shifts and 
broadening of spectral lines that Hughes-Drever-type experiments are designed to de- 
tect. The current tightest constraints are around 5 orders of magnitude tighter than the 
original experiments of Hughes and Drever [765] 1301] , and yield constraints of the order 

n < KT 27 m, (19) 

so that couplings to the second metric must be very weak in order to be observationally 
viable. This result strongly supports the conclusion that matter fields must be coupled to 
a single rank-2 tensor only. It then follows that particles follow geodesies of this metric, 
that we can recover special relativity at any point, and hence that the EEP is valid. 
It should be noted that these constraints do not apply to gravitational theories with 
multiple rank-2 tensor fiel ds t hat couple to matter in a linear combination, so that they 
can be written as in Eq. (16) with g^ v = ^2jCih/w, where cj are a set of I constants. 



Local spatially isotropy, and the EEP, is always recovered in this case. 

Beyond direct experimental tests, such as Hughes-Drever-type experiments, there are 
also theoretical reasons to think that the EEP is valid to high accuracy. This is a con- 
jecture attributed to Schiff, that states 'any complete and self- consistent gravitational 
theory that obeys the WEP must also satisfy the EEP'. It has been shown using conser- 
vation of energy that preferred frame and preferred location effects can cause violations 

13 



of the WEP |594j . This goes some way towards demonstrating Schiff's conjecture, but 
there is as yet still no incontrovertible proof of its veracity. We will not consider the issue 
further here. 

The experiments we have just described provide very tight constraints on the WEP, 
the EEP, and local position invariance. It is, of course, possible to test various other 
aspects of relativistic gravitational theory that one may consider as 'foundational' (for 
example, the constancy of a constant of nature 1240 ). For our present purposes, how- 
ever, we are mostly interested in the EEP. Theories that obey the EEP are often described 
as being 'metric' theories of gravity, as any theory of gravity based on a differentiable 



manifold and a metric tensor that couples to matter, as in Eq. (16), can be shown to 
have test particles that follow geodesies of the resulting metric space. The basics of 
Riemannian geometry then tells us that at every point in the manifold there exists a 
tangent plane, which in cases with Lorentzian signature is taken to be Minkowski space. 
This allows us to recover special relativity at every point, up to the effects of second 
derivatives in the metric (i.e. tidal forces), so that the EEP is satisfied. Validity of the 
EEP can then be thought of as implying that the underlying gravitational theory should 
be a metric one |1273j . 

2.1.2. Observational tests of metric theories of gravity 

In what follows we will consider gravitational experiments and observations that can 
potentially be used to distinguish between different metric theories of gravity. 



Solar system tests 



As well as the gravitational redshifting of light that we have already mentioned, the 
other two 'classic tests' of General Relativity are the bending of light rays by the Sun, 
and the anomalous perihelion precession of Mercury. These can both be considered tests 
of gravitational theories beyond the foundational issues discussed in the previous section. 
That is, each of these tests is (potentially) able to distinguish between different metric 
theories of gravity. As well as these two tests, there are also a variety of other gravitational 
observations that can be performed in the solar system in order to investigate relativistic 
gravitational phenomena. A viable theory of gravity must be compatible with all of 
them. For convenience we will split these into tests involving null trajectories (such as 
light bending) and tests involving time-like trajectories (such as the perihelion precession 
of planets). 

First of all let us consider tests involving null geodesies. As already mentioned, 
the most famous of these is the spatial deflection of star light by the Sun. In General 
Relativity the deflection angle, 9, of a photon's trajectory due to a mass, M, with impact 
parameter d, is given by 

9 = ^r-(l + cos u>)~ 1.75", (20) 
a 

where ip is the angle made at the observer between the direction of the incoming photon 
and the direction of the mass. The 1.75" is for a null trajectory that grazes the limb of the 
Sun. This result is famously twice the size of the effect that one might naively estimate 
using the equivalence principle alone 464 . The tightest observational constraint to date 
on 9 is due to Shapiro, David, Lebach and Gregory who use around 2500 days worth of 
observations taken over a period of 20 years. The data in this study was taken using 87 
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VLBI sites and 541 radio sources, yielding more than 1.7 x 10 6 measurements that use 
standard correction and delay rate estimation procedures. The result of this is [1131 



9 = (0.99992 ± 0.00023) x 1.75", 



(21) 



which is around 3 orders of magnitude better than the observations of Eddington in 1919. 

A further, and currently more constraining, test of metric theories of gravity using 
null trajectories involves the Shapiro time-delay effect |1130j . Here the deflection in time 
is taken into account when a photon passes through the gravitational field of a massive 
object, as well as the deflection in space that is familiar from the lensing effects discussed 
above. The effect of this in General Relativity is to cause a time delay, At, for a light-like 
signal reflected off a distant test object given by 



At = 4M In 



20 12 - In 



(J>s, 



(22) 



where n and r 2 (both assumed 3> d) are the distances of the observer and test object 
from an object of mass M, respectively. The second equality here is the approximate 
magnitude of this effect when the photons pass close by the Sun, and the observer is on 
Earth. Here we have written Rq as the radius of the Sun, and au as the astronomical 
unit. The best constraint on gravity using this effect is currently due to Bertotti, less 
and Tortora using radio links with the Cassini spacecraft between the 6th of June and 
the 7th of July 2002 |147j . These observations result in the constraint 



At = (1.00001 ± 0.00001)At GjR , 



(23) 



where AtcR is the expected time-delay due to general relativity. The Shapiro time-delay 
effect in fact constrains the same aspect of relativistic gravity as the spatial deflection 
of light (this will become clear when we introduce the parameterised post-Newtonian 
formalism later on). This aspect is sometimes called the 'unit curvature' of space. 

Let us now consider tests involving time-like trajectories. The 'classical' test of Gen- 
eral Relativity that falls into this category is the anomalous perihelion precession of 
Mercury (this is called a test, despite the fact that it was discovered long before General 
Relativity |777j ). In Newtonian physics the perihelion of a test particle orbiting an iso- 
lated point-like mass stays in a fixed position, relative to the fixed stars. Adding other 
massive objects into the system perturbs this orbit, as does allowing the central mass to 
have a non-zero quadrupole moment, so that the perihelion of the test particle's orbit 
slowly starts to precess. In the solar system the precession of the equinoxes of the co- 
ordinate system contribute about 5025" per century to Mercury's perihelion precession, 
while the other planets contribute about 531" per century. The Sun also has a non-zero 
quadrupole moment, which contributes a further 0.025" per century. Taking all of these 
effects into account, it still appears that the orbit of Mercury in the solar system has an 
anomalous perihelion precession that cannot be explained by the available visible matter, 
and Newtonian gravity alone. Calculating this anomalous shift exactly is a complicated 
matter, and depends on the exact values of the quantities described above. In Tabic [I] 
we display the observed anomalous perihelion precession of Mercury, Aw, as calculated 
by various different groups. For a more detailed overview of the issues involved, and a 
number of other results, the reader is referred to |1039j . In relativistic theories of gravity 
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Source 


Aw /( " per century) 


Anderson et al. 48J 
Anderson et al. 49J 
Krasinsky et al. |746j: EPM1988 

DE200 

Pitjeva |1040j: EPM1988 
DE200 


42.94 ±0.20 
43.13 ±0.14 
42.984 ±0.061 
42.977 ±0.061 
42.963 ±0.052 
42.969 ± 0.052 



Table 1: The value of the perihelion precession of Mercury obtained from observations by various authors. 
The acronyms EPM1988 and DE200 refer to different numerical cphcmcridcs, which are reviewed in 

[Toll] . 



the additional post-Newtonian gravitational potentials mean that the perihelion of a test 
particle orbiting an isolated mass is no longer fixed, as these potentials do not drop off 
as ~ 1/r 2 . There is therefore an additional contribution to the perihelion precession, 
which is sensitive to the relative magnitude and form of the gravitational potentials, and 
hence the underlying relativistic theory. For General Relativity, the predicted anomalous 
precession of a two body system is given by 

A W = ^ c 42.98", (24) 
V 

where m is the total mass of the two bodies, and p is the semi-latus rectum of the orbit. 
The last equality is for the Sun-Mercury system, and is compatible with the observations 
shown in Table[l] Each relativistic theory predicts its own value of Aw, and by comparing 
to observations such as those in Table [T] we can therefore constrain them. This test is an 
additional one beyond those based on null geodesies alone as it tests not only the 'unit 
curvature' of space, but also the non-linear terms in the space-time geometry, as well as 
preferred frame effects. 

Another very useful test involving time- like geodesies involves looking for the 'Nordtvedt 
effect' 986 . This effect is the name given to violations of the SEP. In the previous section 
we only considered tests of the WEP and EEP, which provide strong evidence that viable 
gravitational theories should be 'metric' ones. Now, it is entirely possible to satisfy the 
WEP and EEP, with a metric theory of gravity, while violating the SEP. Such violations 
do not occur in General Relativity, but do in most other theories. Every test of the 
Nordtvedt effect is therefore a potential killing test of general relativity, if it delivers a 
non-null result. To date, the most successful approach in searching for SEP violations 
is to use the Earth-Moon system in the gravitational field of the Sun as a giant Eotvos 
experiment. The difference between this and the laboratory experiments described in 
the previous section is that while the gravitational fields of the masses in WEP Eotvos 
experiments are entirely negligible, this is no longer the case with the Earth and Moon. 
By tracking the separation of the Earth and Moon to high precision, using lasers reflected 
off reflectors left on the Moon by the Apollo 11 mission in 1969, it is then possible to 
gain the constraint |1277j 

r)= (-1.0 ±1.4) X 10" 13 , (25) 



where t\ is defined as in Eq. (15). This is indeed a null result, consistent with General 
Relativity, and is tighter even than the current best laboratory constraint on the WEP. It 
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can therefore be used to constrain possible deviations from General Relativity, and in fact 
constrains a similar (but not identical) set of gravitational potentials to the perihelion 
precession described previously. 

A third solar system test involving time-like geodesies is the observation of spinning 
objects in orbit. While currently less constraining than the other tests discussed so 
far, these observations allow insight into an entirely relativistic type of gravitational be- 
haviour: gravitomagnetism. This is the generation of gravitational fields by the rotation 
of massive objects, and was discovered in the very early days of General Relativity by 
Lense and Thirring [1204, 785 . The basic idea here is that massive objects should 'drag' 
space around with them as they rotate, a concept that is in good keeping with Mach's 
principle. Although one can convincingly argue that the same aspects of the gravita- 
tional field that cause frame-dragging are also being tested by perihelion precession and 
the Nordtvedt effect, it is not true that in these cases the gravitational fields in question 
are being communicated through the rotation of matter. Now, in the case of General 
Relativity it can be shown that the precession of a spin vector S along the trajectory 
of a freely-falling gyroscope in orbit around an isolated rotating massive body at rest is 
given by 

dS 



where 



ft x S, (26) 

CIT 

xV[/-|vxg. (27) 

Here we have written the vector g = g ai , and have taken v and U to be the velocity of 
the gyroscope and the Newtonian potential at the gyroscope, respectively. The first term 



in (27) is called 'geodetic precession', and is caused by the 'unit curvature' of the space. 



This effect exists independent of the massive bodies rotation. The second term in (27) 
is the Lense-Thirring term, and causes the frame-dragging discussed above. The most 
accurate measurement of this effect claimed so far is at the level of 5 — 10% accuracy, 
and has been made using the LAser GEOdynamics Satellites (LAGEOS) 302 (there 
has, however, been some dispute of this result 641, 642 ). The Gravity Probe B mission 
is a more tailor made experiment which was put in orbit around the Earth between April 
2004 and September 2005. The current accuracy of results from this mission are at the 
level of ~ 15% |476j . although this could improve further after additional analysis is 
performed. 

All of the tests discussed so far in this section have been for long-ranged modifications 
to Newtonian gravity. As well as these, however, there are a host of alternatives to Gen- 
eral Relativity that also predict short-ranged deviations from 1 /r 2 gravity. These range 
from extra-dimensional theories [673[ 1702) , to fourth-order theories |307) and bimetric 
theories [308] , all of which predict 'Yukawa' potentials of the form 

^qf^^, (28) 



\x — x'\ 

where a parameterises the 'strength' of the interaction, and A parameterises its range. 
The genericity of these potentials, often referred to as 'fifth-forces', provides strong mo- 
tivation for experimental attempts to detect them. Unfortunately, due to the their scale 
dependence, one can no longer simply look for the extra force on one particular scale, and 
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then extrapolate the result to all scales. Instead, observations must be made on a whole 
range of different scales, so that we end up with constraints on a at various different val- 
ues of A. These observations are taken from a variety of different sources, with the scale of 
the phenomenon being observed typically constraining A of similar size. So, for example, 
on the larger end of the observationally probed scale we have planetary orbits [1197] and 
lunar laser ranging [1277 j constraining a < 10 -8 between 10 8 m < A < 10 12 m. On inter- 
mediate scales the LAGEOS satellite, and observations of gravitational accelerations at 
the top of towers and under the oceans provide constraints of a that range from a < 10 -8 
at A - 10 7 m [T055] to a < 1(T 3 at lCT 1 ™ < A < 10 4 m gSHl H5T2j . At smaller scales 
laboratory searches must be performed, and current constraints in this regime range 
from a < 1(T 2 at A - lCT 2 m, to a < 10 6 at A - lCT 5 m [6211 EM EH] ■ Weaker con- 
straints at still smaller scales are available using the Casimir effect. For a fuller discussion 
of these searches, and the experiments and observations involved, the reader is referred 
to the reviews by Fischbach and Talmadge [507] , and Adelberger, Heckel and Nelson [TT] . 

Gravitational waves, and binary pulsars 

A generic prediction of all known relativistic theories of gravity is the existence of 
gravitational waves: Propagating gravitational disturbances in the metric itself. How- 
ever, while all known relativistic gravitational theories predict gravitational radiation, 
they do not all predict the same type of radiation as the quadrupolar, null radiation 
that we are familiar with from General Relativity. It is therefore the case that while 
the mere existence of gravitational radiation is not itself enough to effectively discrimi- 
nate between different gravitational theories, the type of gravitational radiation that is 
observed is. The potential differences between different types of gravitational radiation 
can take a number of different forms, which we will now discuss. 

Firstly, one could attempt to determine the propagation speed of gravitational waves. 
In General Relativity it is the case that gravitational waves have a velocity that is strictly 
equal to that of the speed of light in vacuum. Generically, however, this is not true: Some 
theories predict null gravitational radiation, and others do not. So, for example, if one 
were able to detect gravitational waves from nearby supernovae, then comparing the 
arrival time of this radiation with the arrival time of the electromagnetic radiation would 
provide a potentially killing test of General Relativity. There are, however, a number 
of different theories that predict null gravitational radiation. Tests of the velocity of 
gravitational waves therefore have the potential to rule out a number of theories, but by 
themselves are not sufficient to distinguish any one in particular. 

A second, more discriminating test, is of the polarity of gravitational radiation. Gen- 
eral Relativity predicts radiation with helicity modes ±2 only, and so far is the only 
proposed theory of gravity that does so. In general, there are six different polarisation 
states - one for each of the six 'electric' components of the Riemann tensor, Roioj- These 
correspond to the two modes familiar from General Relativity, as well as two modes 
with helicity ±1, and two further modes with helicity 0. One of these helicity-0 modes 
corresponds to an additional oscillation in the plane orthogonal to the wave vector k^, 
while the remaining 3 modes all correspond to oscillations in a plane containing k^ . The 
extent to which observations of these modes can constrain gravitational theory depends 
on whether or not the source of the radiation can be reliably identified. If the source 
can be identified, then the vector fc M is known, and one should then be able to uniquely 
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identify the individual polarisation modes discussed above. We then have 6 different 
tests of relativistic gravitational theory - one for each of the modes. In the absence of 
any knowledge of fc M , however, one cannot necessarily uniquely identify all of the modes 
that are present in a gravitational wave, although it may still be possible to constrain 
the modes being observed to a limited number of possibilities. 

Direct observations of gravitational waves, of the kind discussed above, provide an 
excellent opportunity to further constrain gravity. Indeed, some theories can be shown 
to be indistinguishable from General Relativity using post-Newtonian gravitational phe- 
nomena in the solar system alone, while being easily distinguishable when one also con- 
siders gravitational radiation. This is the case with Rosen's bimetric theory of gravity 
|1068[ 1779"! 11275] . To date, however, the direct detection of gravitational radiation has 
yet to be performed. At present the highest accuracy null-observations of gravitational 
radiation are those of the Laser Interferometer Gravitational- wave Observatory (LIGO). 
This experiment consists of two sites in the USA (one in Livingston, Louisiana and one in 
Richland, Washington) . Each site is an independent interferometer constructed from two 
4 km arms, along which laser beams are shone. The experiment has an accuracy capable 
of detecting oscillations in space at the level of ~ 1 part in 10 21 , but has yet to make a 
positive detection. Further experiments are planned for the future, including Advanced 
LIGO, which is scheduled to start in 2014, and the Laser Interferometer Space Antenna 
(LISA). Both Advanced LIGO and LISA are expected to make positive detections of 
gravitational waves. 

Another way to search for gravitational waves is to look for their influence on the 
systems that emitted them. In this regard binary pulsar systems are of particular interest. 
Pulsars are rapidly rotating neutron stars that emit a beam of electromagnetic radiation, 
and were first observed in 1967 604 . When these beams pass over the Earth, as the 
star rotates, we observe regular pulses of radiation. The first pulsar observed in a binary 
system was PSR B1913+16 in 1974, by Russell Hulse and Joseph Taylor [634] ■ This 
is a particularly 'clean' binary system of a pulsar with rotational period ~ 59ms in 
orbit around another neutron star. Binary pulsars are of particular significance for 
gravitational physics for a number of reasons. Firstly, they can be highly relativistic. 
The Hulse- Taylor binary system, for example, exhibits a relativistic periastron advance 
that is more than 30 000 times that of the Mercury-Sun system. In this regard they 
provide an important compliment to the observations of post-Newtonian gravity that we 
observe in the solar system. Secondly, they are a source of gravitational waves. Given 
the high degree of accuracy to which the orbits of these systems are known, the change 
in angular momentum due to gravitational radiation can be determined and observed. 
In the Hulse- Taylor system the observed decrease in orbital period over the past 30 years 
is 0.997 ±0.002 of the rate predicted by General Relativity 1267 . Finally, neutron stars 
are composed of a type of compact matter that is of particular interest for the study of 
self-gravitational effects. For a review of pulsars in this context the reader is referred to 
[TT77j . 

There are large number of relativistic parameters that can be probed by observations 
of binary pulsar systems .359 . To date, however, the most constrained are the 5 'post- 
Keplerian' effects, which are the rate of periastron advance, the rate of change of orbital 
period, the gravitational redshift, and two Shapiro time-delay effects. These effects are 
familiar from the solar system tests discussed above, apart from the change in orbital 
period that is negligible in the solar system. One further effect that has been measured 
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only relatively recently is the 'geodetic' precession of the pulsar spin vector about its 
angular momentum vector [744] . This is a purely relativistic effect that is observed via 
changes in the observed pulse profile over a period of time that can be attributed to 
our line of sight to the pulsar crossing the emitting region at varying positions due to 
the precession. The determination of the precession rate using these observations is, 
however, complicated somewhat by a degeneracy between the a priori unknown shape of 
the emitting region and the geometry of the system as a whole [1268, 1314] . 

Not all of the post-Keplerian effects are always apparent in any given binary system, 
and not all provide independent tests of gravity. For example, in the Hulse- Taylor binary 
only three of these effects can be observed (the inclination angle of the system on the 
sky is too large to observe any significant Shapiro delay), and there are two unknown 
quantities in the system (the masses of the pulsar, and that of its companion) . The Hulse- 
Taylor binary therefore provides only 3 — 2 = 1 test of relativistic gravity. The recently 
discovered 'Double Pulsar' PSR J0737-3039A/B [840], however, does significantly better 
|745j . All five post-Keplerian effects are visible in this system, and because both neutron 
stars are observable as pulsars the ratio of their masses can be directly inferred from their 
orbits. This leaves only one unknown quantity, and hence gives 5 — 1 = 4 independent 
tests of relativistic gravity. So far, all binary pulsar tests of gravity, including those of 
the double pulsar, are consistent with General Relativity. 

Finally, let us return to constraining gravitational theory through the emission of 
gravitational waves. The effect of emitting gravitational radiation from a binary system 
is to change its orbital period. In General Relativity we know that only quadrupole 
radiation with positive energy should be emitted from a system. For most relativistic 
theories, however, dipole gravitational radiation is also expected, and sometimes this can 
carry away negative energy. The existence of dipole radiation is sometimes attributed 
to violations of the SEP, whereby the centre of the mass responsible for gravitational 
radiation is no longer the same as the centre of inertial mass. If the centre of inertial mass 
is what stays fixed, then the centre of mass responsible for the gravitational radiation can 
move and generate dipole radiation. Dipole radiation is expected to be most dominant in 
binary systems with high eccentricity, and where the companion mass is a white dwarf. 
No evidence for dipolar radiation yet exists (154[ I776j . Null observations that attest 
to this result therefore allow for experimental limits to be set on theories that predict 
positive energy dipolar radiation. The lack of any observation of dipolar radiation can 
also be used to rule out with high confidence theories that allow negative energy dipolar 
radiation, such as Rosen's theory |1275j . 

2.1.3. Theoretical considerations 

As he developed the Special Theory of Relativity, it is often assumed that Einstein's 
inspiration came from experiments pointing towards the constancy of the speed of light. 
It is true that he was certainly aware of these experiments, but he was also inspired by 
theory, specifically his faith in the principle of relativity and the validity of Maxwell's 
equations in any inertial frame. So too, in developing models of modified gravity, we 
should not only take our lead from observation but also from theory. Indeed, theoretical 
considerations are a very powerful tool in testing new models. Typically these involve 
the study of classical and quantum fluctuations about classical solutions. Do the clas- 
sical fluctuations propagate super-luminally? Can we excite a ghost? Do the quantum 
fluctuations become strongly coupled at some unacceptably low energy scale? 
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Ghosts 



Ghosts are a common feature of many modified gravity models that hope to explain 
dark energy. Intuitively it is easy to see why this might be the case. To get cosmic 
acceleration we need an additional repulsive force to act between massive objects at 
large distances. If this force is to be mediated by a particle of even spin, such as a scalar 
(spin 0) or a tensor (spin 2), then the kinetic term describing this must have the "wrong" 
sigrj^] that is, it must be a ghost. 

We should be clear about the distinction between the kind of ghost that arises in 
certain modified gravity models and the Faddeev-Popov ghost used in the quantisation 
of non-abelian gauge theories. The latter is introduced in the path integral to absorb 
unphysical gauge degrees of freedom. It does not describe a physical particle and can 
only appear as an internal line in Feynman diagrams. In contrast, the ghosts that haunt 
modified gravity describe physical excitations and can appear as external lines in Feynman 
diagrams. 

When a physical ghost is present one has a choice: Accept the existence of negative 
norm states and abandon unitarity, or else accept that the energy eigenvalues of the ghost 
are negative [317] . Since the former renders the entire quantum description completely 
non-sensical, one usually accepts the latter. However, it now follows that the ghost 
will generate instabilities if it couples to other, more conventional, fields. When these 
fields are already excited, the ghost can and will continually dump its energy into the 
"conventional" sector through classical processes, since its energy is unbounded from 
below. Even in vacuum, one will get the spontaneous (quantum) production of ghost- 
non-ghost pairs, and in a Lorentz invariant theory, the production rate is divergent |317j . 

There are a few ways to try to exorcise the ghost. One is to isolate it somehow, 
such that it completely decouples from other fields. Another option is to make it heavy, 
so much so that its mass exceeds the cut-off for the effective theory describing the rel- 
evant fluctuations, and one can happily integrate it out. A third option is to break 
Lorentz invariance, perhaps spontaneously, so that one can introduce an explicit Lorentz 
non-invariant cut-off to regulate the production rate of ghost-non-ghost pairs (see, for 
example, [653 ). However, perhaps the safest way to deal with a ghost is to dismiss as 
unphysical those solutions of a theory upon which the ghost can fluctuate. This school 
of thought is exploited to good effect in the ghost condensate model [590] . 

Strong coupling 

Some modified gravity models are said to suffer from "strong coupling" problems. 
Given a classical solution to the field equations, this refers to quantum fluctuations on 
that solution becoming strongly coupled at an unacceptably low scale. For example, in 
DGP gravity, quantum fluctuations on the Minkowski vacuum becomes strongly coupled 
at around A ~ 10~ 13 eV ~ l/(1000km). In other words, for scattering processes above 
A, perturbative quantum field theory on the vacuum is no longer well defined, and one 



2 In our conventions, the Lagrangian for a canonical scalar is C = —-^(dip) 2 , whereas a ghost has 
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must sum up the contribution from all the multi-loop diagrams. One then has complete 
loss of predictivity. Furthermore, the classical solution itself is meaningless at distances 
below A -1 since it would require a scattering process involving energies above the cut-off 
to probe its structure. 

The strong coupling scale is, of course, dependent on the background classical solution, 
and may even depend on position in space-time. Whether the inferred strong coupling 
scale is acceptable, or not, again depends on the background. For example, strong 
coupling at 1000 km on the Minkowski vacuum of DGP gravity is not really an issue as 
Minkowski space does not represent a good approximation to the classical solution in the 
vicinity of the Earth. Indeed, for the classical solutions sourced by the Earth to leading 
order, quantum fluctuations will become strongly coupled at some scale that depends on 
the radial distance from the Earth's centre. Computed at the Earth's surface one should 
require that this lies below an meV since quantum gravity effects have yet to show up in 
any lab based experiments up to this scale. 

It has actually been argued that strong coupling on the vacuum can be a virtue 
in modified gravity models [447] . This is because it can be linked to a breakdown of 
classical perturbation theory, which is necessary for the successful implementation of the 
Vainshtein mechanism [12411 1399] . We discuss the Vainshtein mechanism and strong 
coupling in some detail in the context of DGP gravity in Section |5.5.4| Here we will 
make some generic statements. Consider a model of gravity that deviates from GR at 
large distances. To be significant in terms of understanding dark energy, this deviation 
must be at least 0(1) on cosmological scales, but be suppressed down to < 0(1O~ 5 ) on 
Solar System scales. Therefore, the field or fields that are responsible for the modification 
must be screened within the Solar System. How can this screening occur? One way is for 
the fields to interact so strongly that they are frozen together, so much so that they are 
unable to propagate freely. This is the idea behind the Vainshtein mechanism - higher 
order derivative interactions help to suppress the extra modes near the source (the Sun) . 

Alternative ways to screen the extra fields have been suggested in the form of the 
chameleon 689, 688^, and the symmetron |608j mechanisms. Both methods exploit the 
dependence of the effective potential on the environment. For the chameleon, the mass of 
the field is environmentally dependent, getting heavy in the Solar System. For the sym- 
metron, the strength of the matter coupling is (indirectly) environmentally dependent, 
tending to zero near a heavy source. 

2.2. Einstein's Theory 

Having considered the requirements that must be satisfied by a viable relativistic 
theory of gravity, let us now consider Einstein's theory of General Relativity in particular. 
General Relativity satisfies all of the requirements described in the previous section, either 
by construction (for the foundational requirements) or by trial (in the case of tests of 
metric theories of gravity) . 

General Relativity is a gravitational theory that treats space-time as a 4-dimensional 
manifold. The connection associated with covariant differentiation, a , should be 
viewed as an additional structure on this manifold, which, in general, can be decomposed 
into parts that are symmetric or antisymmetric in its last two indices: 



r'' 



a/3 — 



(a/3) 



[a/3]" 



(29) 
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In General Relativity we take „ = 0, or, in the language of differential geometry, 
we assume that torsion vanishes. We are then left with only the symmetric part of the 
connection, which describes the curvature of the manifold. 

Now, to define distances on the manifold one also requires a metric tensor, g^ v . Along 
the curve 7 this gives the measure of distance 

dXy/g^xfx", (30) 

where A is a parameter along the curve, x M = x M (A), and over-dots here mean dif- 
ferentiation with respect to A. The metric should also be considered as an additional 
structure on the manifold, which is in general independent from the connection. The 
relationship between the connection and the metric is defined via the non-metricity ten- 
sor, Qfta/s = V M 5 Q( 9. In General Relativity it is assumed that the non-metricity tensor 
vanishes. We can now use the metric to define the Levi-Civita connection, which has 
components given by the Christoffel symbols: 

r % = = (flW + g^,a - g a p,») • (31) 

To summarise, as a consequence of the two assumptions Qu a /3 — and T^r fll = , 



the components of the connection are uniquely given by the Christoffel symbols via (31 1, 
and so the connection, and all geometric quantities derived from it, are defined entirely 
in terms of the metric. In General Relativity, therefore, the metric tells us everything 
there is to know about both distances and parallel transport in the space-time manifold. 

The resulting set of structures is known as a Riemannian manifold (or, more accu- 
rately, pseudo-Ricmannian in the case where the metric is not positive definite, as is 
required to recover special relativity in the tangent space to a point in space-time) . Rie- 
mannian manifolds have a number of useful properties including tangent vectors being 
parallel to themselves along geodesies, the geodesic completeness of space-time implying 
the metric completeness of space-time, and a particularly simple form for the contracted 
Bianchi identities: 

v M (sr - \gr*R^ = 0, (32) 

where and R = g^ v R^ v are the Ricci tensor and scalar curvature, respectively . This 
last equation is of great significance for Einstein's equations. 

2.2.1. The field equations 

Having briefly discussed the geometric assumptions implicit in General Relativity, let 
us now display the field equations of this theory: 

Rpu - ^QnuR = - g MV A. (33) 

Here T M „ is the energy-momentum tensor of matter fields in the space-time, and A is the 
cosmological constant. These equations are formulated such that energy-momentum is 
a conserved quantity (due to the contracted Bianchi identity and metric-compatibility 
of the connection), so that special relativity can be recovered in the neighbourhood of 
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every point in space-time (up to tidal forces), and so that the usual Newtonian Poisson 
equation for weak gravitational fields is recovered in non-inertial frames kept at a fixed 
space-like distances from massive objects (up to small corrections). 



The Field Equations (33 1 are a set of 10 generally covariant, quasi-linear second-order 
PDEs in 4 variables, for the 10 independent components of the metric tensor. They 
constitute 4 constraint equations and 6 evolution equations, with the contracted Bianchi 
identities ensuring that the constraint equations are always satisfied. Furthermore, the 
conserved nature of and the Riemannian nature of the manifold ensure that the 
WEP and EEP are always satisfied: Massless test particles follow geodesies, and in any 
freely falling frame one can always choose 'normal coordinates' so that local space-time 
is well described as Minkowski space. 

2.2.2. The action 



As with most field theories, the Field Equations ( 33 ) can be derived from the variation 



of an action. In the case of General Relativity this is the Einstein-Hilbert action: 

S = ^ / y/=g(R - 2A)d 4 x + J C m (g^^)d 4 x, (34) 

where C m is the Lagrangian density of the matter fields, ip, and the gravitational La- 
grangian density has been taken to be C g = yJ—g(R — 2A)/167rG. Let us now assume 
the Ricci scalar to be a function of the metric only, so that R = R(g). Variation of Eq. 



( 34 ) with respect to the metric tensor then gives the Field Equations ( 33 1 , where 



T^= ; 2 _ SCm . (35) 
V -.9 oguu 



The factors of y/—g are included in Eq. (34) to ensure that the Cs transform as scalar 



densities under coordinate transformations, i.e. as 

L = det {w) £ ' (36) 

under coordinate transformations — x^^x"). This property ensures S is invariant 
under general coordinate transformation, and that the resulting tensor field equations are 
divergence free (i.e. the contracted Bianchi identities and energy-momentum conservation 
equations are automatically satisfied). 

We have outlined here how Einstein's equations can be obtained from the variation of 
an invariant action with respect to the metric, once it has been assumed that the space- 
time manifold is Riemannian. The vanishing of torsion and non-metricity then tell us that 
the metric is the only independent structure on the manifold, and the invariant action 
principle ensures that we end up with a set of tensor field equations in which energy- 
momentum is conserved. Because of this formulation the WEP and EEP are satisfied 
identically. Now, when considering alternative theories of gravity one often wants to 
modify the field equations while conserving these basic properties. Modified theories of 
gravity are therefore often formulated in a similar way; from the metric variation of an 
invariant action principle under the assumption of Riemannian geometry, with a universal 
coupling of all matter fields to the same metric. 
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2.3. Alternative Formulations 

The discussion in the previous section involved deriving Einstein's equations under 
the a priori assumption of Riemannian geometry (i.e. assuming to begin with that the 
torsion vanishes and that the connection is metric compatible) . In this case the metric is 
the only remaining geometric structure, and a simple metric variation of the action is the 
only option. We can, however, be less restrictive in specifying the type of geometry we 



wish to consider. For the case of the Einstein-Hilbert action, Eq. (34 1, this usually still 



leads to the Einstein Equations ( 33 ) . For alternative theories of gravity, however, this is 
often not the case: Different variational procedures, and different assumptions about the 
geometric structures on the manifold, can lead to different field equations. It is for this 
reason that we now outline some alternative formulations of General Relativity. A large 
collection of many such formulations can be found in [1032 . 

2.3.1. The Palatini procedure 

The most well known deviation from the metric variation approach is the 'Palatini 
procedure' ( 1020 ] . Here the connection is no longer immediately assumed to be metric 
compatible, but is still assumed to be symmetric and thus torsionless. In addition, all 
matter fields are still taken to couple universally to the metric onljj^] The action to be 
varied is then 

S = ^ J 7=ff [gT r R fW - 2A] d A x + J C m (g^,^)d 4 x, (37) 

where 1 R^ v indicates that the Ricci tensor here is defined with respect to the connec- 
tion and not the metric (at this stage the metric and connection are still independent 
variables), and is given by 

1 R^u = d a T a ^ — dfj,T a au + r paT ^ — r^r^- (38) 

The Ricci tensor defined above, as well as the Einstein tensor derived from it, are in gen- 
eral asymmetric. However, they become symmetric as soon as we assume the connection 



is symmetric. Variation of Eq. (37) with respect to the connection gives the condition 
that the connection is in fact the Levi-Civita connection. Variation with respect to the 
metric then recovers the Einstein equations. 

If the torsionless condition on the connection is dropped then complications arise, as 
has been shown by Hehl and Kerlick [600] . The general form of the connection can be 
shown to be given by 

r\s = \ Ho\ + J*aB = [ ? a \ + K \b + L\ 8 - (39) 



a ? ~ \a(3j T °f> ~ lap J a P "*> 

The tensor field K fl a o is the contorsion tensor, that can be defined in terms of the 
antisymmetric components of the connection, known as the torsion, as 

K" a p^S%- S ap »- Sp/, (40) 



3 This assumption has limited validity, however, as it cannot be applied to tensor fields without using 
a covariant derivative. 
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where a = r M , „, is the torsion tensor. The tensor field L M „ is defined in terms of 

up [ap\ up 

the non-metricity tensor as 

£%^(Q%-Q./- Q p S)- (4i) 

To avoid confusion, we continue to denote the covariant derivative associated with the 
Levi-Civita connection as V^ t , while we use r V M to denote the covariant derivative 
associated with T^ a g. 

Varying the action with respect to the metric g^ v we find the analogue of the Einstein 
equations: 

Gh+Av^^G^. (42) 

One should note that only the symmetric part of the Einstein tensor appears here, and 
that we have used T„ v rather than T„ v to emphasise the fact that T„ v is defined at 
constant T M Q ^ in the variation, i.e. T M „ = — | r - On the other hand T^„ = 

— ^/=|^j|j- This is not an important distinction at this stage, as we have assumed 
that matter field do not couple to the connection, and hence Tn V = T„ v . It will, however, 



be important in the following subsection and in Section [3.5.1 

tensor as P. a P = H 

-9 />i" a)3 



Varying with respect to the connection defines the Palatini tensor as P^ a ^ — 7^= S ^^J' R ^ 



that can be written as 

Pap.? = S^ap + 2g fl [ a Sp] + g^[ a Q/3] ~ 9n[aQ v ]p , (43) 

where — S a „ a , and where we have split the non-metricity tensor into trace and 
traceless parts as Q^ a p — Q^gup + Q^ap, with g a/3 Q f _ ia p — 0. The Palatini tensor has 
only 60 independent components because it is identically traceless: P a ^ a = 0. Now, the 
second field equation is the vanishing of the Palatini tensor, 

P^ = 0, (44) 

but this provides only 60 constraints among the 64 independent components of the con- 
nection. In fact it may be shown that the equation P^ a ^ — is equivalent to the 
connection taking the following form |600j : 

r"a fl = 1^1 - \qJT b = W + %s a P B . (45) 



Clearly then, there are 4 degrees of freedom left undetermined by the field equations. 
Thus the Palatini approach in its most general form does not lead to a unique set of field 
equation^] 

The constraint = is sufficient to produce a consistent theory. This, however, has 
to be imposed as a Lagrange multiplier in the action via a term J d 4 Xy/—g\ a Q a . Once 
this is done, one recovers General Relativity uniquely. For theories of gravity other than 
General Relativity the difference between the metric variation and the Palatini procedure 
is even more significant: The resulting field equations are, in general, different. This will 
be spelt out explicitly for some specific theories in the sections that follow. 



4 It is often said that the Palatini procedure uniquely recovers GR. As we have seen, however, this 
is a myth. It does so only after further assumptions, for instance that the torsion vanishes, or that the 
connection is metric compatible, or that Q a = 0. To make the Palatini variation well defined one has 
to impose such conditions in the action by means of Lagrange multipliers. 
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2.3.2. Metric-affine gravity and matter 

A further generalisation of the metric variation approach is to keep the metric and 
connection completely independent, as discussed above, and further allow matter to 
couple not only to the metric, but also the connection [600] . In this case the action takes 
the form 

S = [ V=9W r R,u - 2A)d i x + [ C m (g^,r\g, ^x, (46) 



167rG J *"■*■)>* ^ J "-'"Wf"" 1 a/3 

where r M Q ^ and g^ are once again independent. Performing the variations we recover 
Eq. ( 42 ) as before, and 

P^ = 8ttGA^, (47) 



where A M Q ^ = — ^= -^P* is called the hypermomentum tensor [6001 1601] . 
In this case T„ u ^ T, LV1 but it is straightforward to find that 



Tiii* -Tin/ -\- V 



A", , - A, p , - A, ' 



(48) 



where is the covariant derivative associated with the Levi-Civita connection. 



Equation (47 1 can be shown to be self-inconsistent for reasonable forms of matter, 
as the Palatini tensor is invariant under projective transformations of the form r A ' Q(9 — > 
T fl a p + XaS^p, while there is no reason to suspect this invariance is exhibited by the 
matter fields and hence the hypermomentum. Equivalently, the Palatini tensor obeys 
the identity P afi a — 0, while there is no reason that this should identically hold for the 
hypermomentunj^] One way to impose self-consistency is to demand that both torsion 
and non-metricity must vanish (by using Lagrange multipliers in the action), leading 
again to General Relativity. This type of self-consistency is, however, very strong, and 
weaker constraints have been found in [600] . One such weaker constraint leads to the 
Einstein-Cartan-Sciama-Kibble theory [601] . that we shall briefly describe in Section 

EXU 

2.3.3. Other approaches 

There are a variety of other formalisms that one can use to derive Einstein's equations. 
We will not go into the full details of all of these here, but merely mention some of the 
approaches that exist in the literature. For brevity we will only consider vacuum general 
relativity here. 

In the l vierbein' formalism the Einstein- Hilbert action can be written 

S=[d 4 xee^R^, (49) 



5 Consider for example a simple Einstein- iEther model for which the matter action is Sm = 
/ d 4 xy/=g [aV M A" V* A^ + A(A M A M + 1)]. The hypermomentum is A M Q/3 = -2A' 3 V M A° ! which clearly 
does not obey A afi a = 0. The variation done this way is inconsistent. On the other hand using the 
Lagrange constraint J d^x^f—gfi^^ ' in the action imposes J^ a p = 0, and hence the vanishing of 

the Palatini tensor. This leads to a modified Eq. |47) , as P M "' 3 = = 8iy G(A . u a ^ — /3 fl a ' 3 ), and to a 
modified Eq. |42| , which now includes derivatives of After using Eq. |48| , however, the resulting 

equations are completely equivalent to the metric variation. 
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where indices with hats correspond to a basis in the tangent space defined by the set 
of contravariant vectors, e. , with determinant e = det[e^ M ]. The inverse of e~ is e^, 

such that e p ' i e- p = (5 M -, and e^e u p = S^ u . The metric tensor is constructed as g^ u — 

rj^oe^e^ . The spin connection to"" then defines a space-time and Lorentz covariant 



derivative, X> M , as T>^v^ — V^w^ + cj p jV^ : where V M is the Levi-Civita connection 6 
curvature tensor R^" 13 is defined in terms of the spin connection as 



The 



R^ = W - + ^ " - u v »o u Mp \ (50) 

Variation now proceeds as in the Palatini formalism by assuming that the spin connection 
and vierbein are independent fields, from which one obtains the two field equations 

= 0, (51) 

and 

G% ee etefffiR^ - \{e^R^)e? = 0, (52) 



where is the Einstein tensor. Equation (51 1 can be used to obtain the spin connection 



in terms of the partial derivatives of the vierbein, and the resulting relation implies that 
u; M a/3 is torsion-less, i.e. one recovers Cartan's first structure equation, de^-l-a/'- Ae" = 0. 
The second equation says that the vacuum Einstein equations are recovered. 

Another interesting alternative formulation of General Relativity is given by the Ple- 
banski formalism [11032] . It is derived from the action 

S = J Yi AB A Rab - \^abcd^ AB A S CD , (53) 

where upper case indices denote two component spinor indices to be raised and lowered 
with e AB and its inverse, and where the exterior product A acts on space-time indices, 
which have been suppressed. The curvature 2-form Rab = cIujab +u A ° Aujcb is defined 
with respect to a spin connection 1-form oj a b . Variation of this action with respect to 
* abcd and u>ab then tells us that the 2-form T, AB is the exterior product of some set 
of 1-forms that we can identify with the tetrad 9 AA , and that the connection luab 1S 
torsion-free with respect to Y> AB . Using this together with the variation of the action 
with respect to T, AB then gives the vacuum Einstein equations, where the metric is given 
by g = 9 AA '®0 AA ,. 

One further alternative formulation of General Relativity is the purely affinc 'Edding- 
ton formalism' |460j . In previous subsections we outlined how one can either treat the 
metric as the only independent structure on the manifold, or treat the metric and con- 
nection as being two independent structures. Another approach is to take the connection 
as the only structure on the manifold. In this case, the simplest way of constructing a 
Lagrangian density with the correct weight (and without a metric) is to simply take the 
square root of the determinant of the Ricci tensor itself: 



S 



yj ' -detlR^Y^x. (54) 



6 Given a metric, g^v, the Levi-Civita connection can always be defined. The question is whether 
that is the connection that is used to define parallel transport. 
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Varying this action with respect to the connection then gives the field equations 



which can be shown to be equivalent to Einstein's equations in vacuum with a cosmolog- 
ical constant, if we take the connection to be the Levi-Civita connection. Due to the lack 
of a metric in the action for this theory, however, it is not a trivial matter to introduce 
matter fields into the theory |69j . 

Finally, let us mention that approaches exist that treat gravity as simply a spin-2 
field on flat space |114l 1115] . It has been conjectured that one could reconstruct the 
Einstein-Hilbert action in such an approach by considering consistency conditions order 
by order in perturbation theory. This will, of course, be an invalid treatment when 
gravity is strong, and in cosmology. 

2-4- Theorems 

There a number of theorems in General Relativity that are of great importance for 
the structure of the theory itself, as well as for the solutions to the field equations. These 
theorems underpin a lot of the acquired intuition on how gravity should function in 
different environments, and what the resulting phenomenology should be. In alternative 
theories of gravity, however, the theorems of General Relativity often fail, allowing new 
behaviours that would otherwise be impossible. 

Here we briefly recap what we consider to be some of the most important theorems 
of General Relativity. In later sections we will show how these theorems are violated in 
alternative theories, and discuss the consequences of this. 

2.4-1- Lovelock's theorem 

Lovelock's theorem |831l 1832) limits the theories that one can construct from the 
metric tensor alone. To enunciate this theorem, let us begin by assuming that the metric 
tensor is the only field involved in the gravitational action. If the action can be written 
in terms of the metric tensor g^ alone, then we can write 



If this action contains up to second derivatives of g^, then extremising it with respect 
to the metric gives the Euler-Lagrange expression 



and the Euler-Lagrange equation is E^ V (C) — 0. Lovelock's theorem can then be stated 
as the following: 

Theorem 2.1. (Lovelock's Theorem) 

The only possible second-order Euler-Lagrange expression obtainable in a four dimen- 
sional space from a scalar density of the form C — C(g fll ,) is 




(55) 




(56) 




(57) 




(58) 
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where a and A are constants, and R pv and R are the Ricci tensor and scalar curvature, 
respectively. 

This powerful theorem means that if we try to create any gravitational theory in a 
four-dimensional Riemannian space from an action principle involving the metric tensor 
and its derivatives only, then the only field equations that are second order or less are 
Einstein's equations and/or a cosmological constant. This does not, however, imply that 
the Einstein-Hilbert action is the only action constructed from g pl , that results in the 
Einstein equations. In fact, in four dimensions or less one finds that the most general 
such action is 

C = aV^gR - 2X^g + ^ vpX R aP R aPpX + 1 ^g~ (r 2 - AR^R\ + R^ v pX R pX u 



where f3 and 7 are also constants. The third and fourth terms in this expression do not, 
however, contribute to the Eulcr-Lagrange equations as 



e ^ p R? a pRj6p\ 



R 2 -4R a p RP a + R^ pX R»\ 



= (59) 
) = 0, (60) 



where the action of E^ v on any function X is defined as in Eq. (57 1. The first of these 
equations is valid in any number of dimensions, and the second is valid in four dimensions 
only. 

Lovelock's theorem means that to construct metric theories of gravity with field equa- 
tions that differ from those of General Relativity we must do one (or more) of the fol- 
lowing: 

• Consider other fields, beyond (or rather than) the metric tensor. 

• Accept higher than second derivatives of the metric in the field equations. 

• Work in a space with dimensionality different from four. 

• Give up on either rank (2,0) tensor field equations, symmetry of the field equations 
under exchange of indices, or divergence-free field equations. 

• Give up locality. 

The first three of these will be the subject of the next three sections of this report. The 
fourth option requires giving up on deriving field equations from the metric variation of 
an action principle, and will not be considered further here. 

2.4.2. Birkhoff's theorem 

Birkhoff 's theorerrj^] is of great significance for the weak-field limit of General Rela- 
tivity. The theorem states |160j 



This theorem is commonly attributed to Birkhoff, although it was already published two years earlier 
by Jebsen 659 . It is not to be confused with Birkhoff's pointwise ergodic theorem. 
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Theorem 2.2. (Birkhoff's Theorem) 

All spherically symmetric solutions of Einstein's equations in vacuum must be static and 
asymptotically flat (in the absence of A). 

Strictly speaking, there are very few situations in the real Universe in which Birkhoff's 
theorem is of direct applicability: Exact spherical symmetry and true vacuums are rarely, 
if ever, observed. Nevertheless, Birkhoff's theorem is very influential in how we under- 
stand the gravitational field around (approximately) isolated masses. It provides strong 
support for the relativistic extension of our Newtonian intuition that far from such ob- 
jects their gravitational influence should become negligible, or, equivalently, space-time 
should be asymptotically flal|^] We can therefore proceed with some confidence in treat- 
ing the weak-field limit of General Relativity as a perturbation about Minkowski space. 
Birkhoff's theorem also tells us that certain types of gravitational radiation (from a star 
that pulsates in a spherically symmetric fashion, for example) are not possible. 

As we will show below, Birkhoff's theorem does not hold in many alternative theories 
of gravity. We therefore have less justification, aside from our own intuition, in treating 
the weak field limit of these theories as perturbations about Minkowski space. We must 
instead be more careful, as the space-time we perform our expansion around can have 
asymptotic curvature, leading to either time or space-dependence of the background (or 
some combination of the two). What is more, the perturbations themselves may be time- 
dependent, and their form can be sensitive to the type of asymptotic curvature that the 
background exhibits. Behaviours such as these are not expected in General Relativity 
[837] , 

2.4-3. The no-hair theorems 

These theorems are named after the phrase coined by Wheeler that "black holes have 
no hair". The first of these theorems was given by Israel and showed that the only 
static uncharged asymptotically flat black hole solution to Einstein's equations is the 
Schwarzschild solution 650 . He later extended this theorem to include charged objects 
|651j , and Carter extended it to black holes with angular momentum j262J . The theorem 
is therefore often stated today as "the generic final state of gravitational collapse is a 
Kerr-Newman black hole, fully specified by its mass, angular momentum, and charge " 
[1253] . 

Complementary to the black hole no-hair theorems is the no-hair 'theorem' of de Sitter 
space. The claim here is that in the context of General Relativity with a cosmological 
constant all expanding universe solutions should evolve towards de Sitter space. This 
has been shown explicitly by Wald for all Bianchi type models^] |1255j . 

These theorems play an important role in General Relativity and cosmology. Some 
progress has been made in extending them to alternative theories of gravity, but there 
have also been explicit examples of them being violated in particular theories. This will 
be discussed further in subsequent sections. 



8 Of course, in a cosmological setting asymptotic regions are never realised as we will eventually come 
across the other masses in the Universe. 

9 Except type-IX universes with large amounts of spatial curvature. 
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2.5. The Parameterised Post-Newtonian Approach 

This section is a recap of the Parameterised Post-Newtonian (PPN) formalism that 
is widely used by both theoretical and observational gravitational physicists. The idea 
here is to create a construction that encompasses a wide array of different gravitational 
theories, and that contains parameters that can be constrained by observations in a rea- 
sonably straightforward fashion. In this way labour can be saved on both the theoretical 
and observational ends of the spectrum: Observers can apply their results to constrain 
a wide array of theories without having to trawl through the details of the individual 
theories themselves, and theorists can straightforwardly constrain their new theories by 
comparing to the already established bounds on the PPN parameters without having to 
re-calculate individual gravitational phenomena. To date, this approach has been highly 
successful, and in the following sections of this report we will often refer to it. We will 
therefore outline here how the PPN formalism proceeds. For a more detailed explanation 
of the principles and consequences of this formalism the reader is referred to [1274] . 

2.5.1. Parameterised post-Newtonian formalism 

The PPN formalism is a perturbative treatment of weak-field gravity, and therefore 
requires a small parameter to expand in. For this purpose an "order of smallness" is 
defined by 

[/~w 2 ~-~n~0(2), 
P 

where U is the Newtonian potential, v is the 3- velocity of a fluid element, P is the pressure 
of the fluid, p is its rest-mass density and II is the ratio of energy density to rest-mass 
density. Time derivatives are also taken to have an order of smallness associated with 
them, relative to spatial derivatives: 

\d/dx\ { 1 

Here we have chosen to set c = 1. The PPN formalism now proceeds as an expansion in 
this order of smallness. 

For time-like particles coupled to the metric only the equations of motion show that 
the level of approximation required to recover the Newtonian limit is goo to 0(2), with 
no other knowledge of other metric components beyond the background level being nec- 
essary. The post-Newtonian limit for time-like particles, however, requires a knowledge 
of 



goo 


to 


0(4) 


m 


to 


0(3) 


9ij 


to 


0(2). 



Latin letters here are used to denote spatial indices. To obtain the Newtonian limit of null 
particles we only need to know the metric to background order: Light follows straight 
lines, to Newtonian accuracy. The post-Newtonian limit of null particles requires a 
knowledge of goo and gij both to 0(2). 

The way in which the PPN formalism then proceeds is as follows. First one identifies 
the different fields in the theory. All dynamical fields should then be perturbed from 
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their expected background values, and the perturbations assigned an appropriate order 
of smallness each. For theories containing a metric the appropriate expansion is usually 



5oo = -l + h$+h$+0{6) (61) 
g 0i = h$+0(5) (62) 
m = % + ^ 2) +0(4), (63) 

where superscripts in brackets denote the order of smallness of the term. If, for example, 
the theory contains an additional scalar held, then the usual expansion for this quantity 
is 

= 0o + V {2) + <^ (4) + 0(6), (64) 

where (f>o is the constant background value of (j>. Additional vector and tensor gravita- 
tional fields can be specified in a corresponding way. 

The energy-momentum tensor in the PPN formalism is then taken to be that of a 
perfect fluid. To the relevant order, the components of this tensor are given by 

Too = p(l + n + v 2 - h 00 ) + 0(6) (65) 
T Ql = -m + 0(5) (66) 
T iJ =pv i v j +PSi j +0(6). (67) 

Taking these expressions, the held equations for the theory in question, and substituting 
in the perturbed expressions for the dynamical fields in the theory, as prescribed above, 
the field equations can then be solved for order by order in the smallness parameter. 

The first step in such calculations is usually to solve for h$ . With this solution in 
hand, one then proceeds to solve for h!^ and simultaneously, and finally can be 
solved for. If additional fields exist, beyond the metric, then these quantities must also 
be solved for to increasing order of smallness as the calculation proceeds. In finding h\- , 

and h$ one needs to specify a gauge. After such a specification one still, of course, 
has the freedom to make additional gauge transformations of the form x^ — > x^ + 
where ^ is 0(2) or smaller. This freedom should be used at the end of the process to 
transform the metric that has been obtained into the "standard post- Newtonian gauge" . 
This is a gauge in which the spatial part of the metric is diagonal, and terms containing 
time derivatives are removed. Once this has been done then one is in possession of the 
PPN limit of the theory in question. 

We have so far outlined the procedure that one needs to follow in order to gain the 
appropriate form of the metric that couples to matter fields in the weak-field limit. Once 
done, the result can then be compared to the 'PPN test metric' below: 

500 = -l + 2G[/-2/3G 2 [/ 2 -2£G 2 $w + (2 7 + 2 + a3 + /3 1 -2£)G$i 
+2(1 + 3 7 - 2/3 + fh + £)G 2 $ 2 + 2(1 + /3 3 )G$ 3 - {Pi - 2£)GA 
+2(3 7 + 3/3 4 - 2£)G$ 4 

5w - - l -{2, + A 1 + a 1 -a 2 +p 1 -2^)GV l - l -(l + a 2 -fi 1 +2i)GW l 

9ij = (l + 2 7 G[/)%. 
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Here (3, 7, £, Pi, /3 2 , P3, P4, ct\i ^2 an d a 3 are the 'post-Newtonian parameters', U is 
the Newtonian gravitational potential that solves the Newtonian Poisson equation, and 
$1, $2, $3, ^4, -4, and Wj are the 'post-Newtonian gravitational potentials' 
(the precise form of these potentials is given in [T274J). The particular combination of 
parameters before each of these potentials is chosen here so that they have particular 
physical significance, once gravitational phenomena have been computed. 

2.5.2. Parameterised post-Newtonian constraints 

Comparison of the weak field metric of a particular theory with the PPN test metric 
above allows one to read off values for the PPN parameters /3, 7, £, fix, /3 2 , /? 3 , P4, «i, a 2 
and «3 for the theory in question. The test metric has been constructed to include the 
type of potentials that often appear when one modifies gravitjj^] The great utility of 
the PPN formalism is that observers can take the PPN test metric above and constrain 
the parameters without having a particular theory in mind. These constraints can then 
be applied directly to a large number of gravitational theories, without having to work 
out how complicated gravitational phenomena work in each theory individually. 

In General Relativity we have that (3 — 7 = 1 and £ = /?j = /3 2 = = ^4 = ot\ = 
ct-2 = ct% = 0. Other theories will predict other values for these parameters, and we will 
discuss these on a case by case basis in the sections that follow. Observationally, one can 
use the gravitational phenomena discussed in Section [T] to impose the constraints that 
follow. 

As already discussed, observations that involve only null geodesies are sensitive to 
the Newtonian part of the metric, <?q , and the term g\,' only. These two terms involve 
the PPN parameter 7 only. We can now use constraints on the bending of light by the 
Sun to get a constraint on 7. Using the PPN test metric the predicted bending of light 
that one should observe is |1274j 

= 2(1+7)- = to^GH, (68) 
r 2 

where m is the mass of the Sun, r is its radius, and Qqb. is the general relativistic 
prediction. Using the observed value of 9 given in Section [I] then gives |1131j 

7- 1 = (-1.7 ±4.5) x 1CT 4 , (69) 

which is consistent with the general relativistic value of 7 = 1. Similarly, we can use the 
PPN test metric to find that the Shapiro time delay effect is given by |1274j 

M = (1 + 7) Mgr ^ (70) 

where subscript GR again means the value of this quantity as predicted by General 
Relativity. Taking the observed value of At given in Section [T] then gives the even tighter 
constraint |147j 

7- 1 = (2.1 ±2.3) x 1CT 5 , (71) 



10 It is not, however, an exhaustive collection of all possible potentials, and in some theories it is 
occasionally necessary to include additional terms. 
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again consistent with 7 = 1. It can now be clearly seen that the bending of light by the 
Sun, and the Shapiro time delay effect do, in fact, constrain the same aspect of space-time 
geometry. They can therefore be considered as complimentary to each other. 

If we now consider observations of gravitational phenomena that involve time-like 
geodesies then we are able to observe, potentially, all of the post-Newtonian potentials in 
the PPN test metric. This becomes clear from the expression for perihelion precession, 
which now becomes 



6nM 

Alo = 

P 



where M is the total mass of the two bodies involved, fi is their reduced mass, and p 
is the semi-latus rectum of the orbit. The affect of modifying the geometry can be seen 
here to be degenerate with the effect due to the solar quadrupole moment, Ji- Once 
the value of this quantity is known, however, then one is able to gain constraints on 
the above combination of j3, 7, a\ 1 a,^ a% and This can be done for any or all of 
the observations of the perihelion precession of Mercury given in Section [TJ and if we 
take the value of 7 to be that given by Eq. (71), as well a^J 1 " 1 "] ot\ ~ ol^ ~ 03 ~ P2 ~ 



and a reasonable value of J2 ~ 10~ 7 , then this gives constraints on j3 of the order 
/?— 1 ~ O(10~ 3 ) or O(10~ 4 ). However, as already noted, these constraints arc somewhat 
sensitive to a number of assumptions about the orbits of the other planets, as well as the 
solar quadrupole moment. 

The Nordtvedt effect is similarly an observation of time-like geodesies. In this case it 
is convenient to define the 'Nordtvedt parameter' 

10 2 2 1 

m = 4/3 - 7 - 3 - - at + -02 - -ft - -ft, (72) 

which is not to be confused with the equivalence principle violation parameter r\ defined 



in Equation ( 15 ). The observations of Williams, Turyshev and Boggs [1277] then give the 
constraint rj N = (4.4 ±4.5) x 10 -4 , which, if we again take 7 to be given by observations 
of the Shapiro time delay effect with all other PPN parameters being zero, gives us 

0-1 = (1.2 ±1.1) x 10" 4 , (73) 

which is a much cleaner constraint on ft than those which can be derived from observations 
of the perihelion precession of Mercury. 

In 'conservative' theories of gravity it is usually only the PPN parameters ft and 
7 (and sometimes £) that vary from their general relativistic values. These quantities 
are often interpreted as the degree of non-linearity in the gravitational theory, and the 
amount of spatial curvature per unit mass that is produced, respectively. The other 
parameters £, on and fti are usually interpreted as corresponding to preferred location 
effects, preferred frame effect and the violation of conserved quantities. When considering 
theories in which such effects are expected to be absent it is therefore usual to assume 
that these parameters are all zero, and to search instead for constraints on ft and 7. 

Of course, one can subject the £, en and fti parameters to observational scrutiny in a 
number of ways. The table below gives a selection of the tightest constraints currently 
available: 



11 Thcse values will be given some justification shortly. 
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Parameter 


Limit 


Source 




10" a 


Ocean tides [1274] 


ai 


10~ 3 


Lunar laser ranging [928] 




4 x icr 7 


Alignment of Sun's spin axis with ecliptic [987J 


a 3 


4 x 1CT 20 


Pulsar acceleration [1 176] 



Further constraints and discussion on the /3j parameters can be found in |1274j . For more 
details of the observations leading to these constraints on £ and on the reader is referred 
to the source material cited above and |1274j . 

The constraints on the PPN parameters that we have discussed above are all, to 
date, in reasonably good agreement with General Relativity, and it is likely that future 
observations of, for example, the 'double pulsar' [8401 1745] will tighten these constraints 
even further in coming years. This excellent concordance of numerous different physical 
phenomena means that one must reconcile any alterations to General Relativity with 
observations in weak held systems that appear to be narrowing down on a general rel- 
ativistic description. As we will describe in the sections that follow, this places tight 
constraints on a variety of different modified theories of gravity: It must be the case 
that any alternative theories that we consider should reproduce General Relativity in 
the appropriate weak field limit, or at least something very close to it. 

There are a number of mechanisms that have been considered in the literature that 
allow for a general relativistic weak field limit even in theories that are, in general, very 
different from General Relativity. These include the Vainshtein mechanism [1241] which 
occurs when large derivative interactions are present, the Chameleon mechanism of theo- 
ries with non-minimal coupling to scalar fields |689| , as well as the attractor mechanism of 
Damour and Nordtvedt [356J . These different approaches allow, potentially, for theories 
that deviate considerably from General Relativity to exist without disturbing gravita- 
tional physics in the solar system to a large extent. They are thought to be successful in 
a number of different environments, and have sometimes been applied to situations that 
are quite different to the ones in which they were originally conceived. 

As well as successful reproductions of general relativistic behaviour, however, there 
have also been a number cases found in the literature of theories that produce weak field 
gravity that is surprisingly inconsistent with the predictions of General Relativity. Per- 
haps the most famous of these is the van Dam-Veltman-Zakharov (vDVZ) discontinuity 
that was originally found in the context of Pauli-Fierz gravity [12431 11296| (a theory with 
one dynamical metric, and one non-dynamical a priori specified metric). Here the gravi- 
ton acquires a mass through the introduction of terms into the gravitational Lagrangian 
that, in the weak field limit, look like mass terms for the perturbations hu V around 
Minkowski space, i.e. like mth^h^. Naively one might then expect in the limit m — > 0, 
when the graviton becomes massless, that the zero mass theory of General Relativity 
should be recovered. This is, however, not the case. Instead one finds from the study of 
linear perturbations around Minkowski space that 7 — > 1/2, which can be seen from the 
constraints above to be in strong disagreement with a number of different observations, 
including light bending and time delay effects. The general relativistic limit in this case 
is therefore a singular one, and any finite but non-zero graviton mass, no matter how 
small, appears to give results that are incompatible with observations. Similar results 
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have also been found in some theories of gravity constructed from more general functions 
of the Ricci curvature than the Einstein-Hilbert action |294j , and are expected in other 
theories as well. In these cases one must either abandon the theory as being incompatible 
with observations, or show that the treatment being applied is unsatisfactory because, 
for example, one of the mechanisms discussed previously should be applied. 

Issues such as those just discussed can make the study of weak field gravity in modified 
theories a more complicated subject than it is in General Relativity. One must be careful 
to make sure that the treatments being applied are justifiable, that the limits of the theory 
take the expected form (rather than being singular) , and that non-linear mechanisms and 
non-perturbative effects are being fully taken into account. How this should be done for 
specific modified theories of gravity will be the subject of subsequent sections. In some 
cases it is still an active area of research. 



2. 6. Cosmology 

We now turn to cosmology, which forms a major part of this review. In this sec- 
tion we first describe cosmology from the point of view of General Relativity, including 
Friedmann-Lemaitre- Robertson- Walker (FLRW) solutions, cosmic distance measures and 
cosmological perturbation theory. We then consider the observational evidence that has 
led to the rise of the "Dark Sector" , thus arriving at the so-called ACDM 'concordance 
model'. We end this section with a short discussion of the successes of ACDM, its pre- 
dictions and potential shortcomings. 

2.6.1. The Friedmann-Lemaitre- Robertson- Walker solutions 

The Robertson- Walker metric in the synchronous coordinate system is 

ds 2 = -dt 2 + a 2 {t) qij dx l dx\ (74) 

where <jy is a maximally symmetric 3-metric of Gaussian curvature n. In a spherically 
symmetric coordinate system this can be written 



ds 2 = -dt 2 + a 



dr 2 

' ~ 2 dn 



1 — nr 



2 



where n is a real constant. If k = 0, the hyper-surfaces of constant t are flat, if k > 
they are positively curved, and if n < they are negatively curved. The function a(t) is 
called the scale factor, and we assume it to be normalised to unity today. 
The Hubble parameter is defined as 

a dt 

with Ho = H(a — 1) being the Hubble constant, i.e. the value of the Hubble parameter 
at the present time. 

The dynamics of the scale factor is given by the Friedmann equation 

3# 2 =87rG^p* (76) 
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where pi are the energy densities of all possible fluids, including photons p 7 , neutrinos 
p v (possibly with mass m„), pressureless matter p m , and spatial curvature p K . We may 
also define the relative densities 

a = (77) 

p T 



3H- 1 
SttG 



J2iPi- The Friedmann equation then 



where px is the total energy density px 
becomes the constraint J2i ^ = 1. 

If each fluid is uncoupled then energy-momentum conservation gives 



3H(l + Wi) Pi = 0, (78) 
For the known forms of matter 



wp. 



where w is the equation of state, defined by P 
W-y = |, w m = 0, W K = — |, and w u is in the range [0, |] . We may solve Eq. ( 78 ) for a few 
cases of interest, and then determine the dynamics of the scale factor. For radiation we 
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obtain p r = p 0r a~ 4 , for pressureless matter p m = p 0m a~ 3 , for curvature p K — 
and for a cosmological constant p\ = A. 

A general analytic solution in the case where all the above fluids are present is im- 
possible. However, analytic solutions can be found in certain special cases. If a single 
fluid is present and w is constant then et 3 ( 1+K ')/ 2 = 3 ( 1 + t ") Hot, provided w ^ — I. For 

the case of radiation we get a = ^/2Hat, and for pressureless matter a — (^y^) 2 ^ 3 ■ The 
case of a cosmological constant is special: One obtains a = e Hat , the de Sitter solution, 
in which space (in this coordinate system) is exponentially expanding. 

In many cases of interest it is convenient to use a different time coordinate, the 
conformal time, r, defined by dt = adr. In a radiation dominated universe we then have 
a(r) = Hqt, in a matter dominated universe a — j(Hqt) 2 , and for the de Sitter universe 
a = }j a ^ _ T ) ; where T m is the value of the conformal time at a — > oo. In a universe 
filled with both radiation and matter we get a = \/fl 0r H T + ^^(Hqt) 2 . A summary 
of these solutions is shown in Table [2j 



Matter type 


a(t) 


a(r) 


radiation 
dust 

radiation & dust 
A 


a = (2H t)^ 2 

a=(|iJ *) 2/3 
complicated 

a = e Hot 


a = Hqt 
a=\(H T) 2 

a = V^o~r H q t + ^ (H r) 2 

a ~ Hoir^-r) 



Tabic 2: A summary of particular solutions to the Friedmann equation. 



2.6.2. Cosmological distances 

Given a Friedmann universe obeying Einstein's field equations, it is useful to define 
observables that characterise the background evolution. Distances play an important role 
if we are to map out its behaviour (see |612) for a more detailed explanation). Hubble's 
law v — H Q d allows us to define a Hubble time, t^ = ^ = 9.78 x 10 9 h^ 1 yr and the 
Hubble distance, Dh — = 3000 hr 1 Mpc. We can also integrate along a light ray to 
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get the comoving distance: 



f t0 cdt' 

D c = c 



a{t>) 

From — k = £l K /D 2 H , and performing the radial integral (assuming the observer is at 
r = 0), we have 



Jfagn3h- 1 [y/n;D M /D B ] for n K > 
D c = I = = { D m for n K = 

^ - Kr2 -^sm-^^QDu/DH] forfi K <0, 



d m dr , Vn K 







where the proper motion distance (also known as the transverse comoving distance) is 
Dm- This can be rewritten as 

D f= smhWrUD c /D H } for Q K > 

for K = 




sin[^O^Dc/£>H] for tt K < 0. 

It is then possible to find an expression for the angular diameter distance: 

D M 



D a = 



1 



Hence, if we know the proper size of an object and its redshift we can work out, for 
a given universe, the angular diameter distance, Da- If wc measure the brightness or 
luminosity of an object, we know that the flux of that object at a distance Dl is given by 
F = 4 J" D -2 , where -Dl is aptly known as the luminosity distance and is related to other 
distances through: 

D L = (1 + z)D u = (1 + zfD A . 

This relation is a consequence of Etherington's theorem [474], and holds in any metric 
theory of gravity, irrespective of the field equations. It is however violated if the photon 
number is not conserved (e.g. due to photon-axion mixing), or if photons are extinguished 
due to the presence of dust. It turns out that in astronomy one often works with a 
logarithmic scale, i.e. with magnitudes. One can then define the distance modulus: 



DM = 5 log 



Dl 
10 pc 



which can be measured from the apparent magnitude, m, (related to the flux at the 
observer), and the absolute magnitude, M, (what the magnitude would be if the observer 
was at 10 pc from the source) through m = M + DM. 

Finally, let us consider Hubble's law. Take two objects that are a distance d apart, 
and Taylor expand the scale factor today to find 

a{t) = a(t ) + a(t Q )[t - t ] + ^a(t Q )[t - t a } 2 + ■■■ . 
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On small scales the distance to the emitter is roughly related to the time of emission, t, 
by d ~ c(t — t). We can then rewrite the above expression as 

where qo = —aa/d 2 \ t= t is the deceleration parameter. On small scales and at small 
redshifts we then have Hubble's law, cz = Hod. 

To constrain the background evolution it is necessary to have good distance mea- 
surements. So, for example, with measurement of supernovae light curves at different 
redshifts it is, in principle, possible to measure Dl(z). Alternatively one might try to 
measure Da(z) by observing known length scales in the universe. This has been done 
spectacularly well with the sound horizon of the cosmic microwave background at redshift 
z ~ 1100. More tentatively, there is a constraint on a combination of Da{z) and H(z) 
using the imprint of acoustic oscillations of baryons on galaxy clustering at moderate to 
low redshifts, z ~ 0.1 — 0.3. 

2.6.3. Perturbation theory 

We now turn to perturbation theory, which is an indispensable tool for making predic- 
tions for a variety of cosmological observations. For extensive treatments of cosmological 
perturbation theory the reader is referred to [5U1 17171 14691 1917] . Here we shall only 
consider scalar fluctuations, for which the perturbed FLRW metric can be written 

ds 2 = a 2 j-(l - 2~)dr 2 - 2(V i l3)dTdx i + 

where = VjVj — §<Jr;A is a trace-less spatial derivative operator. We note that V* is 
the covariant derivative compatible with the 3- metric q^j . Perfect fluids with shear have 
energy-momentum tensors that can be written as 

Tpv = (p + P)Ufj,u v + Pg^u + £/«,, (80) 

where p is the energy density, P is the pressure, the 4- velocity of the fluid (normalised 
to u^Ufj, = —1), and is the anisotropic stress tensor which obeys u^T,^ = E^ 1 = 0. 
In a homogeneous and isotropic space £^„ = 0, and it M is aligned with the time direction 
such that in the coordinate system used above it has components = (a, 0). For first 
order scalar perturbations we can parameterise T* v as 

T° = -p6 (81) 

T°i = -{ P + P)^ i e (82) 

T% = ( P + P)V\8-(3) (83) 

T) = 5PS i j + (p + P)D i j E, (84) 

while the fluid velocity is = a(l — H, Vj6*). Here SP is the pressure perturbation, and 
S the scalar anisotropic stress. 

For any variable X, its perturbation <5X is not necessarily an observable quantity, 
and may depend on a gauge. In particular, one can always define a new perturbation 
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1 + 3X ) Qij+Diju 



dx t dx 3 



(79) 



<5X' = i5X+£^X through the Lie derivative acting on the background tensor X through a 
vector field £ M . The perturbations <5X are thus in general gauge-dependentp] For scalar 
perturbations we can write £ M = <z(— £, Vj^), and then find how our variables transform 
under gauge transformations using the Lie derivative. All of them, apart from E, are 
gauge-dependent, with transformations given by 

X -> X + \[$HZ + 2AV>] i/->i/ + Jty 

5 -> S - |(1 + 6>^6>+i£ 



where W. = ^. 

Given our set of perturbation variables, two linear combinations of them can be 
removed/^] (set to zero) . Popular gauges are 

• Newtonian gauge: v = fi = 0. The remaining metric perturbations give rise to the 
Newtonian potentials $ = — 1% and 'J = — S. 

• Synchronous gauge: S = /3 = (this does not completely fix the gauge). The 
remaining metric perturbations are related to the Ma-Bertschinger [841] variables 
as X = h an d —k 2 v = h + 6r/. 

• Comoving gauge: 6 = v = 0. Strictly speaking there is a multitude of comoving 
gauges depending on which velocity 9 is set to zero. Thus we may speak of a 
"baryon comoving gauge" if 9^ = 0, a "photon comoving gauge" if 9 1 — 0, the total 

matter comoving gauge if 9t — ^ j ^ P ( x+P p' > ^ x = 0, etc. 

• Uniform density gauge: 5 = v = 0. Once again there is a multitude of uniform 
density gauges depending on which density fluctuation is set to zero, as in the 
comoving gauges above. 

• Spatially flat gauge: x — v = fl- 
it is possible to find combinations of perturbation variables that are gauge invariant, 
but note that there are an infinite number of them as any linear combination of gauge- 
invariant variables is also gauge-invariant. Two popular gauge-invariant metric variables 
are the Bardeen potentials $ and "J: 

$ = _i( x _Ai/) + ~tt(i/ / + 20) > ( 85 ) 

and 

y=-E-^(v" + 2(3')-~H(v' + 2p). (86) 



12 The Stewart-Walker lemma |1 188 states that the only gauge-invariant perturbed tensors are those 
that have background values that are either zero or a constant multiple of the identity matrix. 

13 One has to be careful and not over constrain the gauge by removing two combinations that transform 
with the same gauge variable, e.g. S and 8 both transform with £ and therefore cannot be set to zero 
simult aneously. 
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The Newtonian gauge is special in this case as $ = $ and & = 1 J r . From now on we 
will refer to $ and ^ without a "hat" as the Newtonian gauge potentials. The Einstein 
equations in the Newtonian gauge give 



2(A + 3k)$-6-H($'+-H*) = 8nGa 2 M (87) 

i 

2($'+^*) = 8irGa 2 J2(Pi + p i) d i 



+ W + 2H& + ym 1 +V. 2 + ^A J * - (^A + k)$ = 47rGa 2 SPi (89) 

i 

and 

$ - * = 8vrGa 2 ^ (p, + P t ) (90) 



Combining Eqs. mfl and (88) we can find $ in terms of the matter variables as 



2(A + 3k)$ = m 2 J2 n i i S i + m ( l + Wi)0i] (91) 



while \& is then obtained using Eq. ( 90 ) 



Finally, all scalar modes can be decomposed in terms of a complete set of eigen-modes 
of the Laplace-Beltrami operator. For example, a variable A can be decomposed as 

A(x\t) = J d 3 kY{x 3 ,k k ) A(ki,t), 

where the eigen-modes, Y{x^ , kk), obey (A + fc 2 ) Y = . In the special case of topolog- 
ically trivial and spatially flat hyper-surfaces of constant t, we simply have Y = e %h i x , 
The integral transform above is then a Fourier transform. The value of k depends on the 
geometry and topology of the spatial hyper-surfaces: In the case of trivial topology k 
takes values k = yjk 2 — n, where fc* is continuous and obeys fc* > for zero or negative 
spatial curvature, while fc* = N^fn for positive spatial curvature, where N > 3 is an 
integer. 



2.6.4- Gravitational potentials and observations 

One of the main sources of information in cosmology is through the observation of 
perturbations about a Friedmann background. Such perturbations can be probed through 
their effects on the dynamics of particles and light, which we will now describe (see |1028j 
for further details): 

• Density fluctuations: Fluctuations in the matter density field, <5(x), will reflect 
various properties of the cosmological model. The simplest approach is to assume 
that <5(x) is a multivariate Gaussian random field that is entirely described by the 
power spectrum, -P(fc), defined by 



(\5 k \ 2 ) = P(k), 
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(92) 



where 5^ is the Fourier transform of <5(x). The shape of the power spectrum 
contains a wealth of information: The amplitude of clustering as a function of scale, 
its redshift dependence, how its shape on small scales is distorted by small scale 
velocities (known as redshift space distortions) , and acoustic features imprinted by 
the baryons from pre-recombination (known as baryon acoustic oscillations) can 
all be used as distance indicators. The power spectrum can be estimated from 
surveys of galaxies or clusters of galaxies, the clustering properties of which can be 
directly related to the amplitude of fluctuations in the density field (under certain 
assumptions of how galaxies (or clusters) trace the density field (known as bias)). 

• Peculiar velocities: The motion of galaxies relative to the Hubble flow, v % , is de- 
scribed by the non-relativistic geodesic equation given above. In the linear regime, 
the peculiar velocity can be related directly to the density field via the gravitational 
potential: 

where / = dlnS/dhia and we have assumed the general relativistic result $ = 
^. Peculiar velocities will be observable through their effects on the redshift of 
objects, either in redshift galaxy surveys (through their distortion of P(k)), or 
when supplemented with independent distance measurements of each object (using 
the Tully-Fisher relation or supernova light curves) in peculiar velocity surveys. 

• Anisotropics in the Cosmic Microwave Background (CMB): The CMB will be sen- 
sitive to density fluctuations, peculiar velocities, and the gravitational potentials. 
It is usual to characterise anisotropies in the CMB in terms of ST l n ' , the dimen- 
sionless deviations of the black-body temperature of the Universe in a direction 
given by the unit vector, h. We can expand 

5T(n) ^ 

j, = 2 y a lm,¥tm{n), 

where we have spherical harmonics, Yi m (n), and define the angular power spectrum 
^ = 2I+1 J2 m (\ a £m\ 2 } ■ Like P(k), the CfS contain a wealth of information about 
the cosmological model. It is now instructive to delve slightly further into the form 
of™. We can schematically split CMB anisotropies into three cosmological 
contributions, 

ST(h) _ 5T(n) I ST(n) , 5T(n) , 

Ji — J> IlS ' J< llSW ' J> I SEC 

where the first term encompasses all effects from the surface of last scattering, the 
second term (the Integrated Sachs Wolfe effect) is due to integrated effects along 
the line of sight, and the last term encompasses secondary effects such as weak 
lcnsing of the CMB, the Sunyaev-Zel'dovich effect and other such contributions. 
Let us focus on the effect of the gravitational potentials, the consequence of which 
we can see through the geodesic equation for light rays given above [150J. The 
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accumulated redshift of a beam of light along the line of sight is given by 

E f z 
(1 + z) = = 1 + fcm - fobs / ($ + 4f)(dr/dz) dz + higher order terms. 

"em JO 

where factors such as the integrated visibility function have been ignored for sim- 
plicity. The first term is the Sachs- Wolfe effect and, in the case of the CMB, will 
give a redshift to the photons as they climb out of potential wells at the surface 
of last scattering. The second term is the ISW effect, and depends on the time 
dependence of the gravitational potentials along the line of sight, as advertised 
above. 

• Weak lensing: Lensing arise when photon light rays are deflected due to the 
gravitational potentials along the line of sight. The deflection angle is given by 
59 = — Vj_(<i> + ^>)dr, and allows us to relate the true position, ^truo, to the de- 
flected position, #dcf, via 6>t rU c = &dd — ^f^89, where tl i r Ls) is the distance to 
the lens (between the lens and the source ]. In practise we probe the gradient of 
the deflection through the inverse magnification matrix: 

M" 1 = = I + r ^ViVi($ + *)dr. 

de dci Jo r s 

This two by two matrix is parameterised by the convergence, K, and shear param- 
eters 7i and 72. In the case of small deflections this gives 

(93) 

M=( 1 + K + ^ . J ) ■ 04) 
V 72 1 + A — 71 J 

This information can be extracted from imaging surveys of distant galaxies. The 
galaxy shapes (or ellipticities) will be distorted by the intervening gravitational 
potentials. These distortions will induce correlations between the galaxy shapes 
that will reflect the underlying cosmology. Lensing will, of course, also affect the 
CMB photons as they pass through potential wells. 

2.6.5. The evidence for the ACDM model 

There is currently a consensus that in an FLRW Universe that is governed by Ein- 
stein's field equations, roughly 95% of the overall energy density must be 'dark' in order 
to be compatible with observations. The current best fit model claims that about 25% 
of this dark material is in the form of a non-relativistic, non-interacting form of matter 
called dark matter, and that the remaining 70% is in the form of a non-clustering form 
of energy density with a negative equation of state known as dark energy. 

The broad case for Cold Dark Matter (CDM) is as follows [THE]: 

• The rotation curves of galaxies tend to flatten out at large radii. This flattening 
can be explained if the baryonic part of the galaxy resides in a halo of dark matter 
with a density profile that falls of as 1/r 2 . 
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• Clusters of galaxies appear to have deeper potential wells than would be inferred 
from baryonic matter. This is manifest in the motions of galaxies, as well as 
the X-ray temperature of gas, and weak lensing measurements of the integrated 
gravitational potentials. Dark matter halos surrounding clusters explain all these 
observations. 

• Diffusion damping during recombination is expected to wipe out all small-scale 
structure in baryons, preventing the formation of galaxies at late times. Dark 
matter, however, can sustain structure during the damping regime, and will seed 
the formation of galaxies. 

The case for dark energy has been around since the early 1980s. After the proposal of 
the original models of inflation, the idea that the Universe should have Euclidean spatial 
geometry became ever more entrenched in the standard lore. Given that baryons made 
up a small fraction of the total energy budget, and that dark matter makes up about 
25%, there was clearly a shortfall of pressureless matter at late times. Furthermore, 
estimates of the ages of globular clusters of around 12-14 billion years were incompatible 
with a flat, matter dominated universe [1228 . 

There was also tentative evidence from large-scale clustering that a flat, cold dark 
matter dominated Universe could not explain some of the observations. Most notably, 
an analysis of the APM galaxy catalogue in |463j seemed to show that a Universe with 
a cosmological constant might explain the amount of galaxy clustering on a wide range 
of scales. Now, with the advent of what has been dubbed "precision cosmology" in the 
late 1990s, the evidence for dark energy has become even more compelling. In particular, 
the following results make a strong case for presence of an energy density with negative 
equation of state: 

• Measurements of the luminosity distance of type la supernovae are consistent with 
a universe with a cosmological constant, and inconsistent with a flat, matter domi- 
nated universe or an open universe [T034, 1063 . The latest results seem to constrain 
the equation of state, w = P/p~ -1.068±£ggj [5501 1X190] . 

• Measurements of the CMB anisotropics from large to small scales [7321 1442] , com- 
bined with measurements of galaxy clustering from the Sloan Digital Sky Sur- 
vey (SDSS) [T058] . greatly favour a model with D, A = 0.725 ± 0.016 and w = 
-1.10 ±0.14. 

• The cross correlation between the ISW effect from the CMB and a variety of surveys 
of large-scale structure favour w = — l.Ollo'lo) a ^ around 4cr [5341 1611] , 

• The number density of clusters of galaxies as a function of redshift disfavour a flat, 
matter dominated universe. The presence of massive clusters at high redshift point 
accelerating expansion out to redshift z ~ 2 [52"] . 

Although each individual observation may be subject to a variety of interpretations, and 
different systematic effects, the overall concordance is remarkable. Indeed, the model 
that best fits these observations is now known as the concordance model, or ACDM. 
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2.6.6. Shortcomings of the KCDM model 

Perhaps the most serious problem with ACDM is the cosmological constant problem: 
That the observed value of A is around 120 orders of magnitude smaller than the naive 
expectation that it should be of the Planck Mass, Mpj. Super-Symmetric (SUSY) theories 
can lower this expectation to that of the SUSY breaking scale, but this still required a 
bare Ao to cancel the vacuum energy coming from the SUSY symmetry breaking scale 
to about 60 decimal places. One could consider arguing that some unknown physics at 
high energies may provide a mechanism for achieving this level of fine-tuning, but this 
seems unlikely as the problem already manifests itself at low energies. 

Now, suppose that we want to describe all physics up to scales just above the electron 
mass. Then the contribution to the vacuum energy A will include a bare term Ai , a term 
coming from the electron and a term coming from the neutrino. This is schematically 
given by 

A = Ai + c e m A e + c v m v . . . , 

where c e and c„ are coefficients. If we now lower the energy below the electron mass, 
and integrate out the electron, we instead have 

A = A + c v m% . . . , 

for a new bare term A . To get the same observable vacuum energy, A, we must now 
have that Ai and Ao cancel to 32 decimal places. 

It may be thought that there could exist some mechanism that relaxes the effective 
cosmological const antp*] to zero dynamically, but Weinberg |1265j has shown that this is 
impossible. Suppose that there is a set of N scalars, (j) A , that are responsible for driving 
the effective A to zero. These scalars will contribute an effective potential, V(4> A ), to 
the cosmological constant. If we are to approach a global Minkowski metric at these 
energy levels, then V(4> A ) must cancel the other contributions to A to high accuracy as 
the fields settle to the minimum. However, this is hardly a readjustment mechanism: 
If the cosmological constant changes slightly, then the mechanism fails. This proof as- 
sumes Poincare invariance in the scalar sector, which could, however, be considered an 



unnecessary assumption (see Horndcski's theory in Section 3.1.3). 

The present value of A, as implied by cosmological observations, has another potential 
problem associated with it: It has an energy density of the same order of magnitude as 
the average matter density in the Universe today, 

Pa|o=1 ~ Pm\a—1- 

These two quantities scale with the size of the Universe in very different ways, and so 
their similarity at the present time appears naively to be somewhat of a coincidence. 
Hence, this problem is sometimes referred to as the coincidence problem. 

Aside from the problems of the cosmological constant, there are some problems that 
plague dark matter as well. The first is another coincidence problem: Why is the dark 
matter energy density so close to the baryon energy density? This is actually worse 
than it might seem. Baryons are produced non-thermally, out of equilibrium. CDM is 



By effective cosmological constant we mean the Ricci curvature of the vacuum. 
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usually thought to be produced thermally, as weak interaction cross-sections naturally 
give rise to the right dark matter abundance via thermal production. But how can two 
components that have very different production mechanisms have very similar energy 
densitiesp]? Solutions to this puzzle have been proposed [SJ EH1 115051 EMI EH ES3 
12101 587, 5861 I876[ [27] but they typically require additional particles to those that form 
the dark matter, and there is as yet no well accepted mechanism. 

Other problems with dark matter are observational, and we will discuss them only 
briefly. The density profile of CDM, as determined from N-body simulations, is inferred 
to be cuspy. For example the Navarro- Frenk- White (NFW) profile [946] gives pcdm oc - 
close to the centre of a halo. Other simulations give similar results: pcdm <x r~ a with 
a ~ 0.7 — 1.5. Galaxies, however, are observed to have cores such that p flattens out 
at the centre. This is the cusp problem [366 and proposed solutions within the CDM 
paradigm include self-interacting dark matter |1174j , fuzzy dark matter [628] , or various 
feedback processes that expel dark matter. Note that simulations do not have enough 
resolution to probe the small scales where the problem manifests itself, but rely instead 
on extrapolations. However, simulations with increasingly smaller resolutions (although 
still above the probed scales) have not indicated any kind of alleviation to the cusp 
problem. 

Another problem is that of missing satellites 867] I211j . The CDM paradigm predicts 
a rich sub-structure within the main galactic halo that should lead to numerous dwarf 
galaxies orbiting the main galaxy. Indeed, simulations indicate that about 500 satellite 
galaxies should be orbiting the Milky way |909j . On the contrary, however, only about 
30 such dwarfs have been observed. A possible resolution within the CDM paradigm is 
that most of these galaxies are dark galaxies, i.e. have very little or no stars in them, 
and are instead completely dominated by dark matter [1146] . 

A third problem is the tight correlation between dark matter and baryons in galaxies 
that manifests itself in a universal acceleration scale, ao ~ 1.2 x 10~ 10 rn s~ 2 [8951 11094] - 
the Tully-Fisher relation [57911875] . and the Faber- Jackson relation [T09"2"] . Within the 
CDM paradigm, such correlations are not expected to be present, as baryons should not 
know how the dark matter behaves. For further apparent discrepancies between ACDM 
and small scale observations the reader is referred to |756j . 

On cluster scales and larger, the ACDM model can boast of success coming from a 
host of observations: Strong and weak lensing of clusters, X-ray observations of clusters, 
the CMB angular power spectrum, the matter power spectrum, P(k), and supernova 
data. Yet there are a few cases of interesting discrepancies. The collision velocity of 
the bullet cluster |318] may be so large that the probability of it occurring in a ACDM 
scenario is at best ~ 10 -9 [780]. In [510] , however, the opposite conclusion is reached, so 
this appears far from settled. Cosmological voids seem to be more empty of galaxies than 
expected, as has been championed by Peebles [1030 . The CMB angular power spectrum 
has a lack of large-scale power above 60° [1172] (although the statistical significance of 
this is debatable, due to cosmic variance). Certain violations of statistical isotropy or 
other anomalies on large scales in the CMB have also been reported [5911 13371 11294] . It 
remains to be seen whether these are really problems with ACDM, if they are due to 



15 There are also non-thermal candidates for dark matter, e.g. axions, but this does not change the 
argument. 
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systematic effects, or if they are statistical flukes. These difficulties do, however, provide 
some motivation for looking at alternatives to ACDM. 
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3. Alternative Theories of Gravity with Extra Fields 

In General Relativity the gravitational force is mediated by a single rank-2 tensor 
field, or a massless spin-2 particle in the quantum field theory picture. While there 
is good reason to couple matter fields to gravity in this way, there is less reason to 
think that the field equations of gravity should not contain other fields, and one is in 
general free to speculate on the existence of such additional fields in the gravitational 
sector. The simplest scenario that one could consider in this context is the addition of 
an extra scalar field, but one might also choose to consider extra vectors, tensors, or 
even higher rank fields (5111 11247) . Of course, the effect of such additional fields needs 
to be suppressed at scales where General Relativity has been well tested, such as in 
the lab or solar system. This is usually achieved making couplings very weak, although 
novel screening mechanisms such as the chameleon mechanism [689 , 688 and Vainshtcin 
mechanism |1241| have also been explored. 

This section represents an overview of four-dimensional gravity theories with extra 
fields, focusing on additional scalars, vectors and tensors. We note that some theories 
in other sections of this review can also be considered as theories with extra fields (e.g. 
f(R) gravity, galileons, and ghost condensates). The reader is referred to later sections 
for details of this. 

3.1. Scalar-Tensor Theories 

The scalar-tensor theories of gravity are some of the most established and well studied 
alternative theories of gravity that exist in the literature. They are often used as the 
prototypical way in which deviations from General Relativity are modelled, and are of 
particular interest as the relatively simple structure of their field equations allow exact 
analytic solutions to be found in a number of physically interesting situations. Scalar- 
tensor theories arise naturally as the dimensionally reduced effective theories of higher 
dimensional theories, such as Kaluza-Klein and string models. They are also often used 
as simple ways to self-consistently model possible variations in Newton's constant, G. 

3.1.1. Action, field equations, and conformal transformations 

A general form of the scalar-tensor theory can be derived from the Lagrangian density 

PH MB 

c = iL^ j[f{(t))R ~ -9^) v ^ v ^ - 2A W] + K4>)9v), (95) 

where /, g, h and A are arbitrary functions of the scalar field <fi an d C m is the Lagrangian 
density of the matter fields ^ . The function h(<f>) can be absorbed into the metric by a 
conformal transformation of the form [422] 

h{4>)9iiu -> g^u- (96) 

The conformal frame picked out by this choice is one in which there is no direct interaction 
between the scalar field and matter fields, and is usually referred to as the Jordan frame. 
As discussed in previous sections, test-particles in this conformal frame follow geodesies 
of the metric to which they are coupled, and the weak equivalence principle is satisfied 
for massless test particles. The effect of this transformation on the remainder of the 

49 



Lagrangian can then be absorbed into redefinitions of the as yet unspecified functions /, 
g and A. 

By a redefinition of the scalar field (j> we can now set /(</>) — > <f>, without loss of 



generality. The Lagrangian density ( 95 ) can then be written as 



C 



1 

16tt 



(f>R - ^V^V^ - 2A(0) 



(97) 



where to(4>) is an arbitrary function, often referred to as the 'coupling parameter', and 
A is a ^-dependent generalisation of the cosmological constant. This theory reduces 
to the well known Brans-Dicke theory 184 in the limit ui — > constant and A — s- 0, 
and approaches General Relativity with a cosmological constant in the limit uj — > oo, 
<J /lu 2 —> and A — > constant. 



The variation of the action derived from integrating (97 1 over all space, with respect 
to g^ v , gives the field equations 



□0+^(V0) 2 + A 



(98) 



Now, as well as the metric tensor g M „, these theories also contain the dynamical scalar 
field </>, and so we must vary the action derived from Eq. (97) with respect to this 



additional degree of freedom. After eliminating R with the trace of (98), this yields 



(2uj + 3)D0 + w'(V0) 2 + 4A - 2cj)A' = 8irT. 



(99) 



where primes here denote differentiation with respect to (p. These arc the field equations 
of the scalar-tensor theories of gravity. 

It is well known that these theories admit the very useful property of being 'confor- 
mally equivalent' to General Relativity. By this we mean that under a transformation 
of the metric that alters scales, but not angles, one can find a new metric that obeys 
the Einstein equation, with the scalar contributing as an ordinary matter field. This 
does not, however, mean that scalar-tensor theories are the same as General Relativity, 
as the metric that couples to matter fields must also transform. The theory that is re- 
covered after conformally transforming is one in which the metric obeys a set of fields 
equations similar to Einstein's, but with an unusual matter content that does not follow 
geodesies of the new metric (with the exception of radiation fields, or null geodesies, 
which are themselves conformally invariant). This property of scalar-tensor theories can 
sometimes allow their field equations to be manipulated into more familiar forms, that 
allow solutions to be found more readily. 

To be explicit, a conformal transformation of the metric g^ v into g^ u can be written 



(100) 



where T(x) is an arbitrary function of the space-time coordinates x^. The line-element 
is then correspondingly transformed as ds 2 = e 2T ^ds 2 , and the square root of the 
determinant of the metric as y/—g = e ir yj—g (in four dimensions). After performing 
such a transformation we can use the term 'conformal frame' to distinguish the new, 
rescaled metric from the original. 



50 



Among the infinite possible conformal frames we can then identify two which have 
particular significance: The Jordan frame and the Einstein frame. The Jordan frame is 
the one in which the energy-momentum tensor is covariantly conserved and in which test- 
particles follow geodesies of the metric. This is the frame picked out by the transformation 
(96), and is the one in which scalar-tensor theories are most usually formulated. The 



Einstein frame is the conformal frame in which the field equations of the theory take the 
form of the Einstein equations with the scalar contributing as an ordinary scalar field, 
as discussed above. 



Under the transformation ( 100 ) it can be shown that the Ricci tensor and Ricci scalar 
transform as 



R P u = R„ v - 2V M v,r + 2v AI rv„r - (2v Q rv Q r + Or) ^ 



e 2T R = R- 6V M rVT - 6DF, 



(101) 
(102) 



while the d'Alembertian transforms as e 2T D(j) = □0 + 2V M rV Al 0. Here, over-bars on op- 
erators or indices denote that they are defined using the metric g^. Under these trans- 
formations we will now show how the scalar-tensor theories defined by the Lagrangian 



(97), in the Jordan frame, can all be transformed into the Einstein frame. 



conformal transformation ( 100 ) becomes 



First, consider the term in Eq. (97) containing the Ricci scalar, which under the 



1 

16V* 



(103) 



The non-minimal coupling to the Ricci scalar can now be removed by making the choice 
of conformal factor e 2r = _1 such that g^ v — g^/4). This defines the conformal trans- 
formation between Jordan and Einstein frames in the scalar-tensor theories. Applying 



the transformation (100) to the rest of Eq. (97) then gives 
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167T 1 



(104) 



-g (R - 2(3 + 2w)V /i rVT - 2e 4r A) + £,„(*, e 2r g^). 
Now, by making the definitions -\/47r/(3 + 2w) = dT/d'ijj and 8irV(ip) = e 4r A, for the 



scalar -0 and the function U(t/0, we can write the transformed Lagrangian (104) as 
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iv^V^ + U(V) j - ■£,„(*•' -'//,„. i. 



(105) 



In the absence of any matter fields the scalar-tensor theories can now be clearly seen to 
be conformally related to Einstein's theory in the presence of a scalar field in a potential. 
This potential disappears when A = 

In the Brans-Dicke theory [184 the coupling parameter ui is constant, and the scalar 
fields <f) and ip are therefore related by 



In < 



16tt 



(3 + 2u) 



For more general theories with lu = oj((j>) the definition of i[> must be integrated to obtain 
a relation between <j> and ip. By extremising the action ( 105 ) with respect to g^ v and i\) 
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we get the Einstein frame field equations 



G>„ = 87T 



-v Q vv> + v ) fa 



and 



Dtp 



dV 

dip 



= — v 47raT 



(106) 



(107) 



where a 2 = 3 + 2aj and where we have defined the energy-momentum tensor T^j, with 
respect to g p „ so that = e 6r T^. It can now be explicitly seen that while the Jordan 
frame energy-momentum tensor is covariantly conserved, V M T M!/ = 0, its counterpart in 
the Einstein frame is not, V = V^raTV^ip. 

3.1.2. Brans- Dicke theory 



The Brans-Dickc theory is given by the Lagrangian density (97 1 with ui =constant, 
and A = [184J. The behaviour of this theory in the vicinity of isolated masses is well 
understood, and in the case of static and spherical symmetry can be solved exactly by 
the line-element |185j 



ds z 



,2<< 



e ^(dr 2 +r 2 dtf) 



where a = a(r) and j3 = (3(r) are given by one of the following four solutions: 
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Here we have defined A 2 = (C* + l) 2 - C(l-uC/2) > in solution I, and A 2 = 
C (1 — loC/2) — (C + l) 2 > in solution II. The constant C is arbitrary in I and II, 
and given by C = (—1 ± \f—2uj — 3) /(w + 2) in III and IV. The constants B, ao, /3q and 
(/>o are arbitrary throughout. 

Now, while solution / is valid for all values of ui, solutions //, III and IV are only 
valid for ui < —3/2. Solution I is also known to be conformally related to the minimally 
coupled massless scalar held solution of Buchdahl |207j . It can be seen that these solutions 
are not all independent of each other. By a transformation of the form r — > 1/r and 
some redefinition of constants, solution II can be transformed into the u < —3/2 range 



of solution / |152j and solution III can be transformed into solution IV [153] . It was 
also shown in |152j that the independent solutions I and IV are both conformally related 
to the general solution of the static, spherically symmetric case in the Einstein frame, as 
found by Wyman [1255] . 

These solutions are very useful for understanding the gravitational fields around an 
isolated body in Brans-Dicke theory, but are not the only spherically symmetric vacuum 
solutions of the Brans-Dicke field equations. A non-static spherically symmetric vacuum 
exact solution is also known |313j : 

ds 2 = -A{r) a{1 ~^ ] dt 2 

+ A(r) a{1+ ^>t^-^ [dr 2 + A(r)r 2 (d6 2 + sin 2 6d(f> 2 )], (108) 

ftrrf^fl-^y***'^-*, (109) 

where we have A(r) = 1 — — , a = P = + 3, and C =constant. This 

solution reduces to a flat vacuum FLRW metric in the limit C — > (an inhomogeneous 



with 



solution requires C ^ 0). The metric (108) is spatially homogeneous at large r and has 
singularities at t = and r = 2C; the coordinates r and t therefore cover the ranges 
< t < oo and 2C < r < oo. This solution is known to be conformally related to [635], 
and shows explicitly the lack of validity of Birkhoff's theorem in Brans-Dicke theory. 
It also reduces to the Schwarzschild solution when u> — > oo. Black hole solutions in 
Brans-Dicke theory with a power-law potential have been investigated in |850j . 

Let us now consider the weak field limit of this theory. Following the PPN prescription 
outlined in previous sections one can straightforwardly find that the relevant values for 
the PPN parameters are: 

Pppn = 1 and Jppn = n + W , (HO) 

2 + oj 

with all other parameters equalling zero. The value of Newton's constant can also be 
shown to be given by 

where (f> Q is the background (unperturbed) value of the scalar field. It can be seen 
that in the general relativistic limit uj — > oo we then recover the usual values of the 
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PPN parameters, and that for finite u the only parameter that deviates from its general 
relativistic value is 7. 

This value of 7 is valid for both the static and non-static exact solutions shown above. 
It is interesting to note, however, that it is not the value of 7 that one should expect 
to measure outside of a black hole that has formed from gravitational collapse in this 
theory. Such an object can be shown to have an external gravitational held with 7 = 1 
1 107 . as predicted by Hawking [595J . This does not, however, mean that gravitational 
collapse to a black hole proceeds in the same way in Brans-Dicke theory as it does in 
General Relativity. In the Brans-Dicke case apparent horizons are allowed to pass outside 
of the event horizon, scalar gravitational waves are emitted during the collapse, and the 
surface area of the event horizon can decrease with time. Such behaviour does not occur 
in General Relativity, and is allowed here because Brans-Dicke theory can violate the 
condition R iiV k a k v > 0, where k a k a = 0. The problem of understanding black hole 
thermodynamics in Brans-Dicke theory has been addressed in 674 . Here it was found 
that the expression for the entropy of a black hole with an horizon S of area A is given 
by 

1 f ,2 rTr\ 1 <M 



Sbh = - <ZW<W= Y< (112) 

such that Sbh is always non-decreasing, even if the area decreases. This shows that 
the second law of black hole thermodynamics can indeed be extended to Brans-Dicke 
theories, with the effective gravitational constant being replaced by l/4>. For an intuitive 
interpretation of this result in the Einstein frame, and for further discussion on this topic, 



the reader is referred to 489^ . 

Having discussed the gravitational fields of point-like objects in Brans-Dicke theory, 
let us now proceed to use observations of weak field phenomena to constrain the theory. 



This can be done most effectively using the constraint on 7 given in Equation ( 71 1 



derived from observations of the time delay of radio signals from the Cassini spacecraft 



as it passed behind the Sun. Together with the expression (110), shown above, this gives 
the 2cr constraint on the coupling parameter 

u > 40 000. (113) 

This is a very restricting constraint on the theory, and shows that deviations of this 
kind from General Relativity must be very small indeed (see the following subsection, 
however, for a discussion of scalar-tensor theories that can evade this bound while still 
exhibiting significantly different behaviour to General Relativity in the early universe). 

Let us now proceed to discuss the cosmology of Brans-Dicke theory. Using the usual 
FLRW line-element, and assuming a perfect fluid matter content, the field equations 
reduce to: 



Stt (p - 3P) 



<j) (2cj + 3) 



-3H~, (115) 



where over-dots denote differentiation with respect to the proper time of a comoving 
observer, H = a/a, and p + 3H(p + P) = 0. The general solutions to Eq. (114) and (115) 
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are now fully understood [5761 K)5] . At early times the vacuum solutions of O'Hanlon and 
Tupper |993| are recovered, while at late-times one approaches the power-law solutions 
of Nariai [535] (when k = Q): 

aft) = ^[i+^-wOl/^+M 1 -^ 2 )] (H6) 

<j>(t) = ot [2(i-3WO]/[4+3<-(i-w a )] j (117) 

wher j^] p — Wp. These solutions can be considered "Machian" in the sense that the 
matter fields are driving the expansion of the Universe, rather than tf>. 

Let us now consider the general FLRW solutions in terms of a transformed time 
coordinate r\ = r)(t). Such solutions can be found any equations of state W |576) . but 
here let us consider only the radiation dominated solutions with W — 1/3. In this case 
the new time coordinate rj is simply the conformal time r given by adr = dt, and the 
general solution for oj > —3/2 is 



a(r) = ai ( T + t+) 2 2 V T +F(r + r_) 2 2 ^ I +F (118) 



0(r)=0 1 (r + r+) V^F( r + r_) V^F (119) 

where t±, a\ and <\>\ are integration constants, and where 8irp r o/ 'Z4>\a\ = 1. For lu < 
—3/2, however, we instead find 



o(r) = ciiJ(t + t-) 2 + t1 exp tan' 1 , (120) 




r) = <M tan- 1 — . (121) 



For to > —3/2 (w < —3/2) we see that the scale factor here undergoes an initial period 
of rapid (slow) expansion and at late times is attracted towards the solution a(r) oc r, 
or, equivalently, a(t) oc . Similarly, <f> can be seen to be changing rapidly at early 
times and slowly at late times. These two different behaviours, at early and late times, 
can be attributed to periods of free scalar-field domination and radiation domination, 
respectively. If p r o — is chosen then these solutions become vacuum ones that are 
driven by the <f> field alone, and for uj < —3/2 the initial singularity can be seen to be 
avoided. Corresponding behaviour can also be shown to exist for other equations of state, 
W. For a more detailed discussion of this phenomenon we refer the reader to [105j . 

Unlike in General Relativity, in the Brans-Dicke theory it is also possible to have 
spatially flat and positively curved exact vacuum solutions. Spatially flat solutions can 



°Note that in this section we use an upper case W to denote the equation of state of the fluid, rather 
than the usual lower case w used in the rest of the review. This is to avoid confusion with the coupling 
parameter u>. 
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be found by assuming <f> oc t x and a oc t v , and by setting o(0) = 0. When n — the 
vacuum Brans-Dicke equations are then solved by 993 

a(t) oc t |(i+2(i-V3(3+2"))- 1 ) ) (i 2 2) 



-2(l-V 3 (3+2"))" 1 



0(f) oc [-1 . (123) 

For spatially closed solutions one can follow the method prescribed in |95j . Here one 
defines a new quantity y = 0a 2 , and uses the conformal time coordinate r, to write the 
field equations as 

(In </>)„-= V3Ay- 1 (2w + 3)- 1/2 and y, 2 = -4«y 2 + J 4 2 , 

where A is a constant. For k > these equations can be integrated to find y = 
(A/2y/n) sin(2 v / K(r — To)), which then gives the solutions 



0(t) oc tanV 7 < 2 "+ 3 > (^(r - r )), (124) 
a(r) oc ^^IZ^)) , (125 ) 

tan V 4(2^+3) _ Tq )) 

Spatially fiat and closed vacuum FLRW solutions such as those shown here do not exist 
in General Relativity, and show the potential for interesting new behaviour at early times 
in scalar-tensor theories of gravity. Phase plane analyses of perfect fluid FLRW solutions 
to the Brans-Dicke field equations have been performed in |731| 110961 1613] . 

A number of anisotropic cosmological solutions of the Brans-Dicke field equations are 
also known. Bianchi type-/ solutions have been found in [10751 IT3"5] . type-// in [8221 
[SUB IMS, type-/// solutions in [525], type-F solutions in [S2U IMS [S27J [S2SJ I5EE] , type- 
VI and VI h solutions in [SJ1 [5JH1 ED] , type-V II h solutions in [5211575], type-V III 
solutions in (8251 1820j , type-/X solutions in (8251 1820] , and Kantowski-Sachs solutions in 
|826] . Inhomogeneous cosmological solutions have also been found |313j , and braneworld 
cosmologies have been considered in [751 11054) . We will not reproduce any of these 
solutions here, but rather refer the reader to the citations above, and references therein. 
For a discussion of the cosmic no-hair theorems in Brans-Dicke theory the reader is 
referred to [582) . where it is shown that these theorems are valid without imposing any 
strong constraints on the coupling constant, cj, so that initially anisotropic universes can 
evolve towards an isotropic final state. 

Now let us consider perturbed FLRW space-times, within which cosmological obser- 
vations are usually interpreted. For the Brans-Dicke theory these equations have been 
studied many times before, starting with [935J. Here we will present these equations in 
the synchronous gauge and with n = 0, as found in |290j . In this case the equations take 
on a simpler form. For the more general case the reader is referred to [934J , or to the 



uj =constant limit of Eqs. ( 145 )-( 149 1 in Section 3.1.3 for the corresponding equations 



in the conformal Newtonian gauge. Now, the perturbed metric can be written as 

9^u = 9nv + a 2 {r)h^ Vl (126) 

where a(r) is the FLRW scale factor, g M „ is the unperturbed FLRW metric with k = 0, 
and h^ v is the perturbation that satisfies = h^ v = in the synchronous gauge. We 
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can then proceed as normal, and decompose the remaining non-zero perturbations 
into harmonic modes, and decouple the scalar, vector and tensor components. The scalar 
part of the perturbations can be written as in Section [2] 



hij ^ hQij 



(127) 



where the synchronous gauge has been adopted,while S(f> is the perturbation to the Brans- 
Dicke scalar. The operator, as in Section 2l is defined by Dij = V^Vj — |feA. As 
usual we define rj = —(h + k 2 v)/6 (see section^). The perturbed equations ar^J 



2/cV 



H 



ti = 



87ra 2 



2 + m 2 



-3H 



2 ^2" 



(128) 



2rf = 



H 



20 



8na- 



8na' 



84> 



and 



2uj + 3 



(130) 



(131) 



Here the perturbations to the energy density and pressure of the non-interacting fluids 
/ are written as 8pj and SPt, with the peculiar velocity potentials and anisotropic 
stress written as Of and S/, respectively. Primes denote differentiation with respect to 
conformal time, r. 

For the tensor modes we can write the metric perturbations as hij = hxQij, where 
Qij is a harmonic function, and with no tensor component involved in 8<f>. The evolution 
equation for hr is then given by 

Ji't + 2Uh' T 



k 2 hi 



8na 2 



(132) 



where S/ is the tensor contribution to the anisotropic stress of the fluid /. We will not 
write the vector perturbation equations here, which are not expected to be significant 
for most cosmological applications. For perturbation equations written in terms of gauge 
invariant variables the reader is referred to |1285j for the covariant approach, or [934] for 
the Bardeen variable approach (for the Brans-Dicke theory one should take uj ^constant 
in this last reference). 

The background cosmological evolution and perturbations can be used to place con- 
straints on Brans-Dicke theory from a number of different sources. The CMB is one 



17 Various typos in the corresponding equations in 12901 have been corrected. 
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such source, and can be used to place constraints on the coupling parameter, uj. This 
has been done a number of times in the literature 935, 7, 1284J, with the latest results 
based on constraints given by the WMAP 5 year data, the ACBAR 2007 data, the CBI 
polarisation data, and the BOOMERanG 2003 flight, together with large-scale structure 
data from the SDSS data release 4, giving w > 97.8 or u < -120.0 to 2a [T284I . This 
is in keeping with the results of [7], but significantly weaker than those claimed by [935] 
of uj > 1000 to 2cr based on the WMAP first year data. Among the detailed processes 
that lead to these constraints one can see that the change in the horizon size at matter- 
radiation equality is altered in Brans-Dicke theory due to the different expansion rates 



|799j . This length scale is imprinted on the spectrum of perturbations as during the 
radiation era perturbations inside the horizon are effectively frozen, while during matter 
domination perturbations grow on all length scales. Different expansion rates also affect 
the horizon size at recombination, which affects the level of 'Silk damping' that occurs on 
small scales due to viscosity and heat conduction. What is more, the thickness of the last 
scattering surface is also changed, which affects anisotropy on small scales through the 
exponential damping which has its cutoff determined by this quantity. The upcoming 
data from the Planck satellite is, of course, expected to tighten the constraints given 
above still further. 

Another cosmological probe that has been extensively applied to Brans-Dicke theory 
is that of the primordial nucleosynthesis of light elements [H9"2l 1591 [3341 |2"61>I |2"6"B1 UJM 
312 . In the Brans-Dicke theory the scalar field <fi is approximately constant during the 
epoch of radiation domination. Nucleosynthesis therefore proceeds largely as in a general 
relativistic cosmology (up to the effect of 'kicks' on the scalar field due to the annihilation 
of electron-positron pairs |358J), but with a different value of G during this process, and 
hence a different expansion rate. Of course, the time at which weak interactions freeze 
out in the early universe is determined by equality between the rate of the relevant 
weak interactions and the Hubble rate. When the weak interaction rate is the greater 
then the ratio of neutrons to protons it tracks its equilibrium value, while if the Hubble 
rate is greater than the weak-interaction rate then the ratio of neutrons to protons is 
effectively 'frozen-in', and /3-decay is the only weak process that still operates with any 
efficiency. This is the case until the onset of deuterium formation, at which time the 
neutrons become bound and /3-decay ceases. Now, the onset of deuterium formation is 
primarily determined by the photon to baryon ratio, rj-y, which inhibits the formation 
of deuterium nuclei until the critical temperature for photodissociation is past. As the 
vast majority of neutrons finally end up in 4 He the primordial abundance of this element 
is influenced most significantly by the number of neutrons at the onset of deuterium 
formation, which is sensitive to the temperature of weak-interaction freeze-out, and 
hence the Hubble rate, and so G, at this time. Conversely, the primordial abundances of 
the other light elements are mostly sensitive to the temperature at deuterium formation, 
and hence rj^, when nuclear reactions occur and the light elements form. The reader is 
referred to 83J for further discussion of these points. The typical bounds that can be 
achieved on the coupling parameter from observations of element abundances are then 
given by cj > 300 or u < —30, assuming the power-law solutions (1161 and (1171. By 
using the general solutions ( 118 1-( 121 ) , however, these bounds can be somewhat relaxed 
or tightened, depending on the behaviour of <fi in the early universe [312] . 

While the cosmological bounds discussed above are weaker than those derived in the 
solar system, and in binary pulsars, they probe a very different physical environment and 
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scale. They are therefore usually considered complimentary to the constraints imposed 
from observations of weak field gravity, and a useful consistency check. After all, one may 
wish to consider theories in which the coupling parameter uj varies throughout cosmic 
history. Theories in which such behaviour can occur explicitly are the subject of Section 
MM 

3.1.3. General scalar-tensor theories 

The Brans-Dicke theory that has so far been considered is a very special scalar-tensor 
theory, with only a single constant parameter. The more general class of scalar-tensor 



theories contains two free functions, given by uj(<f>) and A(<fi) in Eq. (97). Let us now 
consider these more general theories. 

First of all let us consider the case in which A(<p) — 0. Such theories have been 
well studied in the literature, and are often used to model the possibility of having 
a coupling parameter uj in the early universe that is small enough to have interesting 
effects, while being large enough in the late universe to be compatible with the stringent 
bounds imposed upon such couplings by observations of gravitational phenomena in the 
solar system, and other nearby astrophysical systems. This interest is bolstered by the 
presence of an attractor mechanism that ensures General Relativity is recovered as a 
stable asymptote at late times in FLRW cosmology [356J . We will explain this attractor 
in more detail below. 

Of course, in generalising the Brans-Dicke theory we want to know what the con- 
sequences are for constraints imposed in the weak field limit. The extra complication 
caused by allowing uj to be a function of cj> means that exact solutions are hard to find. 
Perturbative analyses can still be readily performed, however, leading to the PPN pa- 
rameters 

duj ld(j) 1 + uj 

/3pPAr = 1+ (4 + 2c)(3 + 2^ and 7PPW= 2W (133) 

with all other parameters equalling zero. The value of 7 here can be seen to be the same as 
in the Brans-Dicke theory, while the value of j3 reduces to the Brans-Dicke (and General 
Relativity) value of unity when uj =constant. Observations from the Cassini satellite 
therefore place upon oj the same tight constraint as in Brans-Dicke theory (to > 40 000 
to 2a). This constraint, however, now only applies to the local value of uj (i.e. with 
the present day value of <f> in the solar system) . The variation of uj with <j> can then be 
constrained by observations of post-Newtonian phenomena that constrain /3, such as the 
lunar laser ranging experiments described in previous sections. To constrain uj for other 
values of <f>, however, requires making observations in other physical environments, such 
as in the early universe, or near black holes. 

Let us now consider the cosmological solutions of these theories. It has been shown 
by Clarkson, Coley and O'Neill in [303J that the Ehlers-Geren-Sachs theorem can be 
extended to cover scalar-tensor theories of gravity. Taking the FLRW line-element, and 
assuming a perfect fluid matter content, the field equations in this case reduce to 



" 2 - t-S-»HS 



„(p-3P) 3H± (135) 



<t> (2uj + 3) 4> (2uj + 3)0 ' 
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where over-dots again denote differentiation with respect to the proper time of comoving 



observers. These equations are similar to those of the Brans-Dicke theory, Eqs. (114) 



and (115), except for the extra term on the RHS of Eq. (|135|) . Exact solutions with 



k = have been found to Eqs. ( |134{ and ( [135] ) in [§1 110041 fTTM IMS] , and vacuum 
and radiation dominated solutions for arbitrary spatial curvature have been found in 
[95] I899| 1111] . Some of the methods used in these papers are extended to anisotropic 
cosmologies in [898] , and the asymptotics of FLRW cosmologies in scalar-tensor theories 
have been studied in [1121 11125] , Exact homogeneous and anisotropic solutions are 
found in |327l I156j that act as past and future attractors for the general solution. Exact 



homogeneous self-similar solutions are found in [132] . and inhomogeneous self-similar 
solutions are found in [157] . We will not reproduce these solutions here, some of which 
can be quite complicated, but will instead return to the attractor mechanism expounded 
in [555] . 

This mechanism is most easily seen in the Einstein conformal frame, given by the 



Lagrangian (105), such that for a spatially flat FLRW geometry the evolution equation 
for the scalar field can be written as 

Q 

^— V>" + 4tt(1 - w)ip' + V4^(l - 3w)a = 0, (136) 

where here primes denote differentiation with respect to the natural log of the Einstein 
frame scale factor, a, and w is the equation of state P — wp. The reader will recall that 
ijj = -y/(3 + 2ui) 1 16-7T In <j> is the scalar field in the Einstein frame, and a~ 2 = 3+2w denotes 
the strength of coupling between the scalar and tensor degrees of freedom. Equation 



( 136 1 is clearly the equation for a simple harmonic oscillator with a dynamical mass, 
a damping force given by — 47r(l — w), and a driving force given by the gradient of a 
potential (1 — 3w)T, where the reader will recall T = v4~7r / adip. This interpretation of 
r as an effective potential is often used to justify an expansion of the form 

T = a (i> -W + yW- V>o) 2 + 0((V» - V-o) 3 ), (137) 

where tpQ is an assumed local minimum of T(ip), and a$ and /3q are constants. In terms 
of this parameterisation the PPN parameters (3ppN and "/ppn then become 

I-Pppn = -^"f^ (138) 



I-Ippn = t^V (139) 
1 + as 



The requirement of positive mass in ( 136 1 can also be seen to be equivalent to the 



requirement of positive energy density, p, in the Einstein frame. 



The cosmological dynamics that result from Eq. (136) are that i/j, and hence 



approach a constant value during the radiation dominated epoch. This is due to the 



vanishing of the 'potential' in (136) when w — 1/3, and the negativity of the effective 
'damping force'. Once radiation domination ends, however, and matter domination be- 
gins, then the scalar field rolls down to the minimum, tpo, of the now non-zero potential 
r(-0) (assuming such a minimum exists). Once this minimum is reached, after some 
possible oscillations in the case of an under-damped system, then we are left with a = 0, 
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which is the general relativistic limit of these theories. This is a very useful general prop- 
erty of any scalar-tensor theory which has a local minimum in its parameter r(^), and 
means that interesting new behaviour is possible at early times, while still being (poten- 
tially) compatible with observations that appear to point towards General Relativity at 
late-time. 

Using order-of-magnitude approximations, the authors of [356J claim that this attrac- 
tor mechanism is powerful enough to drive the value of the PPN parameter 1 — 7 down to 
values of as low as ~ 10~ 7 . This is a couple of order of magnitudes below the level that 
is probed by even the observations of the Cassini spacecraft, but is not inconceivably 
small. In particular, it may be that upcoming observations of binary pulsar systems 
could achieve such levels. Further predictions of this scenario are a possible oscillation 
in the effective value of Newton's constant near the beginning of the matter dominated 
epoch of the Universe's history, as well as a prediction for the locally measured value of 
Pppn given by 



/W-l = ^(l-7), 



(140) 



where /3q is defined in Eq. (137). The validity and limitations of these results are 
extended, and are further studied in |357| I1095J . 

Let us now consider perturbations around a general FLRW background, in these 
generalised theories. We will work in the conformal Newtonian gauge, which has the 
usual correspondence with Bardeen's gauge invariant variables. Tensor perturbations 
on cosmological backgrounds have been studied in [109] . while the scalar part of the 
perturbed line-element takes the form 



ds 2 = a 2 [-(1 + 2*)dr 2 + (1 - 2$)q ij dx i dx j ] 



(141) 



where we have used conformal time, r, and q^j is now the metric of a static 3-space with 
constant curvature. Perturbations to the scalar field and energy momentum tensor are 
given by Sip and 



5T\ 



-Sp 

-{p + P)V l 
SPS 1 , + (p 4 



(142) 
(143) 
(144) 



where p, P and 9 are the total energy density, pressure and peculiar velocity of the matter 
fields. The first-order perturbation equations are then given by 



* + 3 — $' + (fc 2 - 3k)$ 
a 



8tt 



Sp 



1 

a 2 4> 



la 2 



dtu 



3S</> 
a 2 4> 
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a 2 4> 



5<t>' 



(145) 
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+ 2 — * + 2$ 
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a 2 <p 
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— i 

a 



2k 2 



and 



8tt 



( P + P)6 2 



1 

a 2 (f> 



a 2 4> 2 



(147) 



We also have the perturbed scalar field equation 
a 

' ' d 2 w 4>' 2 54> 2 duj 



2 eL S # + fc 2 ^ - 2(/ >"* - ^ (V + 4-vK + 3$') - tea* {6p- 36P) 
a \ a ) (2u) + 3) 



a 

~ (2w + 3) d(/> 2 
as well as the condition 



+ 



a 2 d<fi 



2 duj 
a 2 dtp 



+ 2-<// )5<f> 
a 



$-*= 8 ^(p + P)S 



(149) 



This last equation shows that, unlike in General Relativity, $ ^ ^ when anisotropic 
stresses vanishes (unless the perturbations to the scalar field also vanish). Primes here 
denote differentiation with respect to the conformal time, r, and k is the wave-number 
of the perturbation. 

Using the equations given above with n = 0, an analysis of the first year WMAP data 
has been performed and used to constrain the parameters ao and /?o of the attractor 
model in [935] . The authors of this study find that the following constraint can be 
imposed on these parameters at the 2a level of significance: 

-700 



a <5x 10" 



(150) 



One should bear in mind here that as /3q — > Brans-Dicke theory is recovered, and 
as ao — > General Relativity is recovered. As a corollary of this result these authors 
also constrain the value of Newton's constant at recombination to be no more than 5% 
different from the value measured in the solar system today, at the 2a confidence level. 
The effect of allowing a non-zero spatial curvature should be expected to weaken these 
bounds. 

Big bang nucleosynthesis has also been explored in the context of general scalar-tensor 
theories [1210, 358, 3191 1773j . In |358j it is found that the inferred upper bound on the 
baryon density in the Universe is relatively insensitive to the presence of a gravitational 
scalar field, and that the parameters of the attractor model must satisfy the constraint 



"o < lO- 6 - 5 ^" 1 



0.15 
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-1.5 



(151) 



when /3q > 0.5. For /? < 0.5 these bounds are weakened by a few orders of magni- 
tude. These results are extended and rehned in |319j . who also allow for a non-zero 
self-interaction potential for the scalar field. The apparent tension between observed and 
theoretically predicted abundances of Lithium-7 is addressed in the context of scalar- 
tensor theories in [773]. Here the authors point out that a period of expansion slower 
than in General Relativity before primordial nucleosynthesis, together with a period of 
more rapid expansion during nucleosynthesis, can resolve this conflict. They find such be- 
haviour in numerous scalar-tensor gravity theories, both with and without self-interaction 
potentials. 

Inflation in scalar-tensor theories of gravity has been extensively studied, often under 
the name 'extended inflation', as coined by La and Steinhardt for the case of Brans-Dicke 
theory [762] . The motivation behind this is the possibility of producing a successful 
inflationary phase transition from a false vacuum state, thus avoiding the fine tuning 
problems associated with 'new inflation'. Unfortunately, it was soon found that bubble 
collisions at the end of inflation produce unacceptable fluctuations in the CMB [126~4"1 I75T1 
804 . Suggestions to improve this situation were to include a self-interaction potential 
for the scalar field [763] . generalise the couplings of the Brans-Dicke scalar to other 
fields |617j (see also [614U1260] '). include quantum effects |616j . add additional couplings 
between the inflaton and the space-time curvature [775], or to consider more general 
scalar-tensor theories of gravity 1185] [521] • The latter of these approaches was dubbed 
'hyper-extended inflation'. The inflationary solutions of general scalar-tensor theories 
have been studied in detail in [106) 1212) [86] , and specific models that could be compatible 
with observations were proposed in [554J . Density perturbations in inflationary scalar- 
tensor scenarios have been investigated extensively in [5511 15781 ITT2T1 15371 10051 [7551 15101 
HT51 15151 HTTl [7501 [TT5T1 ITT351 I5B51H2B2] . Studies of topological defects [5531 [1555] . black 
holes [625] , gravitational waves [12291 1145) , baryogenesis (55J 1100) , baryon asymmetry 
[1295] . dark matter [874)1875] . the formation of voids [802 )1803) . bubble nucleation rates 
and dynamics [1255111055] , reheating [5JSJ, stochastic inflation [5^I5231[52^[TT0T1[TT02] . 
slow roll inflation [p_5J 15271 11212] , non-Gaussianity [1047] , isotropisation of the Universe 
[5831 1584) . and quantum cosmology [533 have all also been performed in the context 
of inflation in scalar-tensor theories. The initial conditions for inflation in scalar-tensor 
theories have been considered in 407, 441, 406 . For further details the reader is referred 
to 1993 review of extended inflation by Steinhardt [1184) . 

Theories of gravity with non-minimally coupled scalar fields and non-zero self- interaction 
potentials have been studied by a number of authors under the name 'extended quintessence' 
[10351 [741 1551 110611 11109] . Such theories can act as dark energy as well as model possible 
deviations from General Relativity at early times. These papers include studies of small 
angle CMB temperature and polarisation power spectra, the integrated Sachs- Wolfe ef- 
fect, the matter power spectrum, supernovae observations and the affects that should be 
expected on weak lensing observations. The FLRW solutions of theories with power-law 
self-interaction potentials have been studied in further detail in [252) , where the attractor 
mechanism to general relativity is investigated, as well the presence of periods of accel- 
erating expansion at late and early times. Late-time acceleration in models without a 
potential for the scalar field is studied in [483] . 

Another interesting possibility in scalar-tensor theories of gravity is the idea of 'grav- 
itational memory', proposed by Barrow in [93] . The idea here is that when a black 
hole forms one of two things can happen (or some combination of them). Firstly, the 
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Schwarzschild radius of a black hole, which is given by rg = 2G(t)m, could vary as the 
value of the scalar field controlling the value of G varies in the background universe. In 
this case there is no such thing as a static black hole solution to the gravitational field 
equations, unless the black hole exists in a static universe. Secondly, the Schwarzschild 
radius of a black hole could be frozen in at its value when the black hole formed, so 
that r$ = 2G(tf)m, where i/ is the time when the black hole formed. In this case 
black holes that formed early on in the Universe's history would remember, in some 
sense, the conditions of the early universe, this being reflected in the value of G(tf). As 
Barrow points out, these two possibilities have consequences for the evaporation, and 
explosion, of black holes in the late universe. This idea has motivated a number of 
studies on the gravitational field of collapsed objects in scalar-tensor theories of gravity 
[571 IT2TTI IT2T51 IrJMl 15551 155DI IM81 IM7] . One particularly interesting approach is that 
of matched asymptotic expansions, which suggests that the first option is followed, and 
black holes do not have any gravitational memory [11351 111361 11134j . 

Horndeski's theory 

The most general four dimensional scalar-tensor theory with second-order field equa- 
tions was worked out by Horndeski in [623] . It has the following Lagrangian 



(F + 2W)RJ"' - 4f x VV^V v V^ + 2k 8 V q 0V'WV / 9< 



3[2(F + 2W),4, + Xk s }V^V^ + K 9 (cf>, X), (152) 



where X — V M </>VV, and ^„ 2 2 ;.'.u^ = n\5^l5Y^...S^. The theory depends on four arbi- 
trary functions of <j) and X, = Ki((p,X) as well as F = F(cj>,X), which is constrained 
so that Fx = Ki,<b — k 3 — 2Xk 3 x- Note that W = W(4>), which means that it can 
be absorbed into a redefinition of F{<f>,X). This paper is not very well known, and as 
a result Horndeski's theory has not been well explored. It has, however, been recently 
resurrected in [276], where aspects of the theory on FLRW backgrounds were studied. 
The effective Lagrangian describing the cosmology in the minisuperspace approximation 
is given by 

3 

Lf(a, a, 0, 0) = a 3 £ (a„ - B n ~) H n , (153) 

n=0 

where H = a/a is the Hubble parameter, and where we have 

A = -Q 7 ,*j> + Ko (154) 

B = Qi,*j> + 12k 3 2 - 12F (155) 

A 1 = -l2F t4 ,j> + 3(Q 7 4 - Q 7 ) + 6^ 3 (156) 

B 1 = -Qxj> + Qi (157) 

A 2 = -12F - 12F^ 2 (158) 

A 3 = 8/si,^ 3 , (159) 
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where 



n 10 a 

Qi = n . = — 12ki, and 



(160) 



d(f> d<j) 
It is assumed that matter is minimally coupled to the metric g^, and not to the scalar. 
Indeed, it is argued that if the equivalence principle is to hold then this can be as- 
sumed without further loss of generality. The cosmological field equations are presented 
implicitly as a generalised Friedmann equation: 



dLf 
da 



nroff 
\ OL H 

d<j> 



reff 



(161) 



and the scalar equation of motion 



dLf 

d<t> 



d_ 
dt 



8Lf 



d(f> 



= 0, 



(162) 



where p is the energy density of the cosmological fluid. 

In |276j the authors look for those corners of Horndeski's theory that admit a self- 
tuning mechanism. They demand that the vacuum space-time is Minkowski, irrespective 
of the value of the cosmological constant, and that this should remain true even after 
a phase transition in which the cosmological constant changes by some amount. This 
is not in violation of Weinberg's theorem since Poincare invariance is explicitly broken 
by the scalar. These considerations reduce Horndeski's theory to four base Lagrangians 
known as the Fab Four. 



•^georgc y/ y^george R 

Gringo = \Z~5K-ingo(0)G, 



(163) 
(164) 
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where G = Rn Ua f)R^ va ^ ~ ^Ru,vR^ v + R 2 is the Gauss-Bonnet combination, and 



r afj — 



_ Rfl u 



a/3 



is the double dual of the Riemann tensor. These terms give rise to self-tuning cosmologies 
for k < 0. The relevant cosmological field equations are given by 



H john + "Hpaul + ft 



gcorgc 



ftVingo — — [PA + Pmatter] 



(167) 



where we have separated the net cosmological constant contribution, p^, an d the matter 
contribution, p ma tten and where 
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Hp au i = -WpiM^H [5H Z + 3 



^gcorgc 6V^ eor g e ((/>) 



65 



H 



H(j> 



V 

gcorgc 
^er cor ere 



The scalar equations of motion are £j hn + £paui + £gcorge + £ringo = where 



fjohn = Gj t [a 3 ^ ohn (0)0A 2 ] - 3a 3 ^ ohn ((^ 2 A 2 

fpaui = -9^ [a¥ paul (^)^aA 2 ] +3a 3 V; aul {0)<P 3 HA 2 

fgcorge - -6^ [a 3 F g ' corgc (0)A 1 ] +6a 3 ^ orgc (0)0Ai 



+6a 3 F g ' COI . gc (0)A 2 

Gringo = -ZW^Mjt « 3 (^A x + ^A 3 ) 




and we define A n = H n — y^-^ J • We see that the self-tuning is achieved at the level 

of the scalar equation of motion, since on a Minkowski solution one has H 2 = — -4- =>• 
A„ = for n > 1. In vacuum, the cosmological constant controls the value of the scalar 
via the generalised Friedmann equation. A detailed study of the phenomenology of the 
fab four has yet to be carried out, but the authors of |276j argue that the 'john' and 
'pauP terms are expected to play a crucial role, as their derivative interactions could give 
rise to Vainshtcin effects that could help pass solar system constraints. The Vainshtein 
mechanism is discussed in detail in Section f5. 5. 41 

Note that it has been shown that Horndeski's general theory is equivalent to |402) in 
four dimensions |710j . Aspects of cosmological perturbations are studied in |710| that 
may be applied to the Fab Four in the appropriate special case. 

3.1.4- The chameleon mechanism 

The 'chameleon mechanism' was introduced as a concept in gravitational physics by 
Khoury and Weltman in 689 , 688 . The basic concept here is that if we consider theories 
with a non-minimally coupled scalar field, then in the presence of other matter fields 
these scalars can acquire an effective mass parameter that is environmentally dependent. 
One can then potentially satisfy the tight constraints on non-minimally coupled scalar 
degrees of freedom that are imposed in relatively dense environments, such as exist in 
the solar system, while still having interesting new behaviour in less dense environments, 
such as those that can exist in cosmology. 

This mechanism is usually formulated in the Einstein conformal frame, where the 
coupling between the scalar curvature and scalar field is minimal, but where the scalar 
field couples non-minimally to matter fields. The relevant action is then 



are a set of constants denoting the coupling of ip to each of the i matter fields 'J';. In 
the scalar-tensor theories so far discussed the scalar field should be considered coupled 
to each of the matter fields with the same universal coupling, which in the Brans-Dicke 
theory is given by (3~ 2 = 2(3 + 2w). Assuming such a coupling, the non-relativistic limit 
of the scalar field equation can then be written as 




(168) 



where, in the notation used in Eq. (105), we have taken T 



V&Trfiiip, and where the 
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where Ves(tp) = V(tp) + pe^*^ . This new 'effective potential' can be seen to be depen- 
dent on the ambient energy density, and if tp and /3 are both positive then any runaway 
potential with dV/dtp < will result in an effective potential with a local minimum whose 



position depends on p. What is more, for couplings of the type specified in Eq. ( 168 ) 
the local effective mass of the scalar field tp, given by = d 2 V c s/dtp 2 , can be seen to 
generically increase with increasing p. Hence, the name 'chameleon'. 

The behaviour of scalar fields outside of massive objects, when the chameleon mech- 
anism is present, can be shown to be crucially dependent on the ratio of Atp to <J> C . 
Here Atp denotes the difference in the value of the scalar field inside the object, tp c , and 
asymptotically, ipoo, while $ c is the value of the Newtonian potential at the surface of 
the object, where r = R c . More precisely, when one satisfies the condition 



Woo - 4> C ) <<; 1 (17Q) 



6/3$ c 

then the resulting configuration of gravitational fields is found to be one in which tp 
occupies the minimal of the effective potential inside the bulk of the massive object, 
except for a thin region of depth AR C just below its surface where the value of tp rises. 
Outside of the object tp increases further, and approaches its asymptotic value V'oo as 

2/3 f3AR c \ M c e- m ~l r - R J 
ip^ipoc 7= —5— , (171) 



S8tt V R c 

where is the effective mass of the field at asymptotically large distances from the 
object of mass M c . Now, the ratio of the thickness of the shell just below the object's 
surface to the object's overall radius, AR C /R C , can be shown to be well approximated by 
the LHS of Eq. (1701. The condition given in ( |170 ) is then equivalent to the condition 



that a 'thin shell' should be present, with AR C /R C -C 1. 

If the 'thin shell' condition is not met then one instead has tp ~ ipoo everywhere, and 
the exterior solution is given by 

2/3 M e e- m ~ 

V-Voo 7= ■ (172) 

V8n r 



A comparison of Eq. (172) with Eq. (171) immediately shows that without a thin shell 
variations in tp are no longer suppressed by the small factor of 3Ai? c /i? c , and that we 
should therefore expect in this case more obvious consequences to the existence of tp 
within the vicinity of massive objects. Khoury and Weltman proceed to argue that in 
order to avoid violations of the weak equivalence principle, and unacceptable deviations 
from the predictions of General Relativity in the solar system, we should require that 
the Earth, and other astrophysical bodies, should satisfy the thin shell condition [688 . 

This idea of a scalar field with an environmentally dependent mass has sparked 
widespread interest since it was proposed. In particular, it allows for the possibility of 
measuring fifth forces, or violations of the weak equivalence principle, that are different in 
space than they are on Earth, (BjU EHU EUl GUI H2~5B1 EM MS[ fTTMl HTM1 IT^gl . 
It can act as dark energy [1911 1197j , and has been studied in the context of structure 
formation [196, 188J, as well as a number of other cosmological scenarios |912l 11901 19131 
11911 484, 239, 362 . The effect of 'chameleon particles' on searches for axion-like particles 
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and experiments involving magnetic fields have been studied in [192, 14, 537 , and their 
effect on the propagation of light in astrophysics in [219. 220J. Experimental searches for 
chameleons have now been performed by GammeV [2551 ITTEfl IT23"gI ITT8"3"] , and ADMX 
[1079] . which have started to constrain the viable parameter space of these theories. 
Other tests of this scenario are also proposed in [1821 ESS [7881 fTT08l H99l fT86] . 

3.2. Einstein- JEther Theories 

Vector-tensor theories, in the form of Einstein-aether theories, have had a revival over 
the past decade, and are now often used as a counterfoil to test General Relativity. They 
have the particular property that they single out a preferred reference frame and have 
become somewhat of a theoretical workhorse for studying violations of Lorentz symmetry 
in gravitation. In the Einstein-aether theory [655j violations of Lorentz invariance arise 
within the framework of a diffcomorphism-invariant theory, and their modern incarna- 
tions are a refinement of the gravitationally coupled vector field theories first proposed 
by Will and Nordtvedt in the 1970s [12761 1989j . The presence of a Lorentz-violating 
vector field, henceforth called the (Ether, can dramatically affect cosmology: It can lead 
to a renormalisation of the Newton constant [25 7j . leave an imprint on perturbations in 
the early universe (8091 1677j , and in more elaborate actions it can even affect the growth 
rate of structure in the Universe |1311[ 113141 1589j . 

3.2.1. Modified Newtonian dynamics 

Some of the theories that we will discuss in this subsection and the next have been 
constructed to give modifications to Newtonian gravity on galactic scales. To be more 
specific, they should lead to Milgrom's Modified Newtonian Dynamics (895] . also known 
as MOND, in regimes of low acceleration. Given its relevance for Einstein-aether theories, 
we will now briefly describe the motivation for MOND, and how it works. We also briefly 
mention some of its successes and failures. 

MOND was first proposed as a possible explanation of the need for dark matter in 
galaxies, based on observations of their rotational velocities. With Newtonian gravity 
and the visible baryonic matter in galaxies only one expects that the rotational velocity, 
v r , should depend on the distance from the centre of the galaxy, r, as v r oc r -1 / 2 . What 
is in fact found in observations of spiral galaxies is that v r is approximately constant 
at large radii. The conventional answer to this problem is to posit that galaxies sit 
in halos of dark matter, with energy density profiles that vary as p ~ r~ 2 for large r. 
Milgrom's proposal was that, alternatively, Newton's inverse square law of gravity could 
be modified in the low-acceleration regime of galactic dynamics. Such a modification, it 
was ventured, may be able to account for the anomalously high rotational velocities in 
spiral galaxies without invoking any new matter fields. 

In MOND the spherically symmetric gravitational potential has two regimes: High 
acceleration and low acceleration. In regions of high acceleration (where \a\ ^> do, for 
constant ao), it simply satisfies Newton's second law: a = — V$ where <& is the gravi- 
tational potential. On the other hand, in the low acceleration regime (where \a\ <C cio)> 
Newton's second law is modified to (\a/ao)a — — V<I>. Albeit a simple rule of thumb, 
Milgrom's proposal is remarkably successful at fitting a large range of spiral galaxy ob- 
servations. Furthermore, it can be used to explain the Tully-Fisher relation that relates 
the velocity of rotation of a spiral galaxy with its intrinsic luminosity. Unfortunately 
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MOND is unable to explain the dynamics of clusters of galaxies without recourse to 
additional dark matter (possibly in the form of neutrinos), and the behaviour of dwarf 
spheroidals in different environments is also problematic. Nevertheless, it is an interesting 
proposal that has had a renewed surge of interest in the past decade. 
The non-relativistic Poisson equation in MOND can be written as 

= 4ttG P , (173) 

where p is the energy density in baryons, and the function fi(x) — ¥ 1 as x — > oo and 
p(x) — ¥ x as x — >■ 0. There are a variety of proposals for the precise form of (a(x) that 
fit observations of galaxies to a greater or lesser degree. As a theory of modified gravity, 
however, MOND's greatest limitation is that it is restricted to non-relativistic regimes. 
It therefore cannot be used to make prediction on cosmological scales, nor can it be used 
to calculate fundamentally relativistic observables, such as lensing. Many of the theories 
that follow in this section have been constructed to address this deficiency: They are 
relativistic gravitational theories that have MOND as a non-relativistic limit. 

3.2.2. Action and field equations 

As the name suggests, vector-tensor theories involve the introduction of a space-time 
4- vector field, A 1 *, A general action for such theories is given by 



S = / oTx 



' R + C{g^,A v ) 



l6nG 



+ S M (g' MV ^), (174) 



where S m is the matter action. Note that the matter fields \fr in Sm couple only to the 
metric g^ v , and not to A v . 

Let us now focus on Einstein-aether theories, and hence forth consider A^ to have a 
time-like direction. The simplest (and most thoroughly studied) version of the Einstein- 
aether theory is quadratic in derivatives of A v , and has the form 

C EA {g^,A v )=^^[K^ a ^^ v A^+\{A v A v + l)], (175) 

where K}*" a p = cig^ v g a p + C2^ a b v ^ + ^S 1 ' a - c i A tl A v g a p and A is a Lagrange 
multiplier. In what follows we will use the notation c±2... = cx + C2 + . . . . We call the 



theory derived from Eqs. (174) and (175) the linear Einstein-aether theory. 



A more general, non-linear Lagrangian for the aether field can be written in the form 
Cc*A<!r,A») = ^F { K) + ^X(A^ + 1), (176) 

where K — K^ u a/3 V \iA a V ' V A^ , and M has the dimension of mass. We shall call this a 
generalised Einstein- aether theory. 

Such actions arise from Lorentz violating physics in quantum gravity. Indeed, the 
linear Einstein-aether theory can be constructed using the rules of effective field theory, 
and has been shown to be stable with regard to quantum effects [1273]. Such theories, 
however, can suffer from instabilities at the classical level, with the onset of caustics in a 
finite time [33 lj . This raises the question of whether the vector field in such theories are 
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merely an effective (possibly composite) degrees of freedom, or whether they are genuine 
fundamental fields. 

The gravitational field equations for this theory, obtained by varying the action for 
the Generalised Einstein-aether theory with respect to <7 M " are given by 
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= Tn V + 8ttGT. 
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where F K = |f and J p a = (K^ 
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K ufl p a )V ' v Ap . Brackets around indices denote sym- 
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metrisation, and is defined by the functional derivative Y^ v — V a A p V pA — — . 
The equations of motion for the vector field, obtained by varying with respect to A v , are 



V^F K J^)+F K y v 



where we have defined y v — V a A p V pA° — 
respect to A fix A v A v = — 1. 



= 2XA V , (179) 
Finally, variations of the action with 



3.2.3. FLRW solutions 

In a homogeneous and isotropic universe with perfect fluid matter content, the vector 
field will only have a non- vanishing T component, so that A p = (1, 0, 0, 0). The equations 
of motion then simplify dramatically, so that V^A^ — 3H and K — 3^^-, where a = 
c i + 3 C 2 + C3. Note that the a we have defined here has the same sign as K. The field 
equations then reduce to 



1-aK 1 ' 2 - 
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If we now take F(x) — r fX n , the modified Friedmann equations become 



2(n-l)* 



8nG 



where e = (1 — 2n)j(— 3a) n /6. We also get the relationship 



6(0» - 1) 
(1 - 2n)(-3a)" 
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(180) 
(181) 

(182) 
(183) 



where Cl m = SnGpo/SH 2 , and Hq is the Hubble constant today. Let us now consider a 
few special cases: If n — 1/2, the Friedmann equations are unchanged (e = 0) and there is 
no effect on the background cosmology; with n — 1 we have that e = ja/2 and Newton's 
constant is rescaled by a factor of 1/(1 + e) 257 ; if n = we recover a cosmological 
constant, A ~ sign(—^)M 2 . More generally, we will obtain different regimes depending 
on the relative size of each term in the modified Friedmann equation. We can summarise 



these behaviours in Figure 3.2.3 
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Figure 1: A schematic representation of the late-time evolution of FLRW solutions as a function of n 
and 7, for n < 1. 



3.2-4- Cosmological perturbations 

The four-vector can be perturbed as A^ = (1 — \I>, -V*V), where V is a small 

tt2 

quantity. Perturbing K to linear order then gives K = K + K\, where K = i^p- and 
K x = -2^{k 2 \ + 3iJ* + 3$). The gravitational potentials * and $ come from the 
perturbed metric: 

ds 2 = a 2 (r) [-(1 + 2*)dr 2 + (1 - 2<f>)q ij dx i dx j ] , (184) 

were qij is the unperturbed conformal metric of the hyper-surfaces of constant r. 
The evolution equation for the perturbations in the vector field are 

= a[V" + k 2 V + 2HV' + 2H 2 V + + + 2%^] (185) 

a" 

+c 2 [k 2 V + m 2 V - 3 — V + 3$' + 3ft#] 
a 

a" 

+c 3 [k 2 V + 2H 2 V - —V + $' + Wtf] 
+ *^[-K ia H - K' a (- Cl (V' + tf) + 3c 2 W + caWV)]. 
The perturbation in the vector field is sourced by the two gravitational potentials $ and 
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ty. The first-order perturbations to the vector field's stress-energy tensor are 

a 2 ST% = F KCl [-Hk 2 V - k 2 V - fc 2 *] 

+F K a[Hk 2 V + 3H$' + - ZF KK (i}i 2 K x 



a 2 5T\ 



(186) 



F KCl [-Uk 2 V -k 2 V - kH] + F K a(2n - l)[Uk 2 V + 3H$' + 3H 2 *] , 
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-<xF^$' - 2aK'^m + a\n(F KK )'K 1 H - c 2 K' k 2 V}6 1 p 

where the second expression for a 2 6T° assumes the monomial form for F(K). In the 
absence of anisotropic stresses in the matter fields, we may obtain an algebraic relation 
between the metric potentials $ and <J> by computing the transverse, traceless part of 
the perturbed Einstein equations. This gives 



fc 2 (*-4>) = 



= ( Cl +c 3 )k 2 [F K (2HV + V')+F KK K' V}. 
We then find the following expression for the perturbed field equations: 

1 



fc 2 $ = --F KCl k 2 [V + * + (3 + 25 3 )W] 
-AirGa 2 ]T (p a S a + 3(p„ + P a )U0 a ) 
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Before we look at the cosmological consequences of these theories, and constraints that 
can be imposed on them, it is instructive to study the effect of the vector field during 
matter domination. This should allow us some insight into how the growth of structure 
proceeds in the generalised Einstein-aether case. First let us consider the simplest case 
in which the dominant contribution to the energy density is baryonic, so that we can 
treat it as a pressureless perfect fluid. Let us also introduce the new variable V = E. 
For illustrative ease we will initially consider only the case where V is described by 

a growing monomial, such that V = Vq (^^j i During the matter dominated era we 



then have a 2 ST° n 
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(191) 



where k^ub = ^-/^today Hence, the vector field affects the evolution equations for the 
matter and metric perturbations only through its contribution to the energy density 
and anisotropic stress. On large scales, fcr <C 1, and assuming adiabatic initial con- 
ditions for the fields 8,®, 9, this leads to 8 = d(k) + p^^j r 5+p - 6 ", where C x is 
a constant of integration and we have omitted the decaying mode. Therefore, even 
before horizon crossing, the anisotropic stress term due to the vector field can influ- 
ence time evolution of the baryon density contrast. On small scales, kr ^> 1, we find 

5{k,r) = C 2 (fc)r 2 + (5+p j^ ( to+ ) p „ 6 „ ) e(fc)(fcr) 2 r 5 +P- 6 ", where C 2 {k) is another constant 
of integration. Hence, for sub- horizon modes, the influence of the vector field on the evo- 
lution of 5 is a combination of its affect on the energy density and anisotropic stress 
contributions, though both, in this limit, result in the same contributions to the scale 
dependence and time evolution of the density contrast. 

3.2.5. Observations and constraints 

Let us now consider the constraints that can be imposed on these theories. First 
of all we will consider the linear Einstein-aether theory, and then we will consider the 
generalised Einstein-aether models. 

In the case of the linear Einstein-aether theory, a number of non-cosmological con- 
straints on the Cj have been derived: Most notably, a Parameterised Post-Newtonian 
(PPN) analysis of the theory leads to a reduction in the dimensionality of parameter 
space. This is occurs due to the requirement that c 2 and C4 must be expressed in terms 
of the other two parameters in the theory as = (— 2c 2 — CiC3+c|)/3ci and C4 = —c\jc\. 
Additionally, the squared speeds of the gravitational and aether waves with respect to 
the preferred frame must be greater than unity, so as to prevent the generation of vac- 
uum Cerenkov radiation by cosmic rays. A final constraint arises from considering the 
effects of the aether on the damping rate of binary pulsars. The rate of energy loss in 
such systems by gravitational radiation agrees with the prediction of General Relativity 
to one part in 10 3 . In the case of the Einstein-aether theory it has been shown that to 
agree with General Relativity in these systems we must require that c + = c\ + C3 and 
c_ = c\ — C3 are related by an algebraic constraint. A more exotic, but viable, subset 
of the parameter space can be considered if we set c\ ~ C3 = 0. The PPN and pulsar 
constraints are then no longer applicable, and a cosmological analysis is potentially the 
only way of constraining the values of the coupling constants. 

Using a combination of CMB and large-scale structure data |1315] it is possible to 
impose constraints on the coefficients of the theory, Cj, as well as the overall energy 
density in the aether field. The main effect on the evolution of perturbations is through 
the change in the background evolution, and not necessarily through the presence of 
perturbations in the vector field. Indeed, artificially switching off the perturbations in 
the aether field has essentially no effect on the power spectrum of large-scale structure, 
and a small effect (of approximately 10%) on the angular power spectrum of the CMB. 
In Figure [2] we plot the join constraints on c+ and c_ that can be imposed from these 
observables. 

If we consider the generalised Einstein-aether theory, we find that the effect on the 
CMB is much more pronounced. Let us first consider a Universe with no dark matter, 
and in which the perturbations in the aether field simultaneously mimic a perturbed pres- 
sureless fluid in the formation of large scale structure, whilst behaving entirely differently 
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Figure 2: Likelihood plot in the parameter space of — c+ and — c_ from observations of the CMB and 
large-scale structure. The black lines are the 1 and 2a contours, for which we have marginalised over the 
values of the other parameters. The hatched region is excluded by Cerenkov constraints. The dashed 
line indicates the constraints available from binary pulsars. 



in the cosmological background 1314 . The hrst requirement for successful perturbation 
evolution is that structure can form at all. One necessary condition for this is that the 
sound speed of the structure seed is not too large, since this would wash out structure. 
It is therefore required that the sound horizon in the models we are considering should 
be less than the smallest scales where linear structure can form: Csfc max r < 1, where 
J; max ~ 0.2/i/Mpc. For matter power observations at r ~ 3 x 10 4 , which is the present 
epoch, this yields Cs < 10 -4 . 

There are two underlying physical processes that can constrain these models. The 
first is a change in the rate of growth of the amplitude of perturbations. This can cause 
discrepancies between the amplitudes we expect in the matter power spectrum and the 
CMB, since the evolution between the two is different. It can also lead to an integrated 
Sachs- Wolfe effect during the matter era, as $ becomes time dependent. The second 
process is due to the increased magnitude $ — \I>. This also leads directly to a non- 
negligible integrated Sachs- Wolfe effect in the matter dominated era. The details of each 
of these processes depends on the functional form of F, the time-dependence of the £ 
growing mode, and the choice of the parameters Cj. It is extremely challenging to find 
combinations of the parameters that allows for a realistic growth of structure, while 
simultaneously ensuring the integrated Sachs- Wolfe effect is acceptably small. 

A consequence of these two effects is that it is impossible to find models where the 
aether field replaces the dark matter that fit the available cosmological data. This is not 
due to the matter power spectra, which can be reasonably fitted to the SDSS data, but 
from the CMB. In the \ow-£ regime a large ISW effect is clearly present, destroying any 
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chance of fitting the CMB data at large scales. The positions of the peaks are also poorly 
fit by the model. Finally, to fit the matter power spectrum to the data requires rescaled 
by a factor 0.02, which corresponds to a galaxy bias of 0.14. Such a scaling is considered 
to be improbably small, on physical grounds. All these effects cause severe problems 
when attempting to simultaneously fit the CMB and large-scale structure. 

Finally, let us consider the possibility of late-time accelerating expansion. A detailed 
comparison with the data seems to allow a range of values for the index n, and the three 
coupling terms of the theory, which can produce this behaviour. In the limit n ac — > 0, 
however, the aether field behaves exactly as a cosmological constant term. 

3.3. Bimetric Theories 

In this section we will consider theories that involve two rank-2 tensors. These are 
often referred to as "bimetric" , or "tensor-tensor" , theories of gravity. The first formu- 
lation of a bimetric theory appears to be due to Rosen |1067| 11068] , and involves the 
addition of an extra non-dynamical rank-2 tensor into the theory. Rosen's theory, how- 
ever, is now known to lead to the existence of states that are unbounded from below in 
their energy. As a result Rosen's theory predicts the spin up of pulsars, as gravitational 
waves with negative energy are emitted. This severely violates the constraints on these 
systems that have been imposed by observations of millisecond pulsars |779j . 

Following in Rosen's footsteps, there were a number of proposals over the years of 
how one could formulate a viable bimetric theory of gravity. Here we highlight what we 
consider to be some of the most interesting cases. These include Drummond's bimetric 
(or "bi-vierbein" ) theory, which is claimed to mimic the dark matter in spiral galaxies 
[434], as well as arguments by Magueijo that bimetric theories could exhibit a variable 
speed of light, thus providing a way to model time-varying fundamental constants. More 
recently, Banados and collaborators have shown that a general form of bigravity, which 
includes specific forms previously proposed in 647, 355 might allow one to account for 
some aspects of the dark sector (S3 [TUj. Finally, Milgrom has recently proposed a 
bimetric theory that reduces to MOND in the appropriate limits. In what we follows, 
we will briefly outline each of these theories. 

The basic idea behind bimetric, or tensor-tensor, theories is the introduction of a 
second 'metric' tensor into the theorjj^J a dynamical metric, g^ u , and a second metric, 
g a /3. The first of these is usually universally coupled to the matter fields, and is used to 
construct the energy-momentum tensor of the non-gravitational fields. It is this field is 
used to define the geodesic equations of test particles. The equations that govern g^, 
however, are not the Einstein field equations: They invariably involve g a p as well. 

If the g a p is not dynamical, then it is usually taken to be highly symmetric (i.e. 
exhibiting the maximal 10 Killing vectors X, such that Cxg a /3 — 0). An obvious choice 
for g a /3 is the Minkowski metric, 7^ t „, so that all components of the Riemann tensor 
constructed from g a p vanish. Rosen's bimetric theory is a particular example of such 
a construction, as are some attempts to construct a massive theory of gravity. If g a p 
is to be dynamical, then a kinetic term of the Einstein-Hilbert form is required in the 



18 A second rank-2 tensor would probably be a more accurate description of what is actually being 
added here, as the term 'metric' implies a particular geometric function. Nevertheless, the term 'metric' 
for this additional field is commonly used, and so we follow this convention here. 
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gravitational action. Coupling terms are then also required between g a p and <jv„, with 
the matter fields usually coupling to either one, or a combination of both, metrics. 

3.3.1. Rosen's theory, and non- dynamical metrics 

As advertised, Rosen's bimetric theory is constructed with an extra flat metric, g a p — 
ria/3, such thalp^l 

WupaGap) = 0. (192) 

We can now define a covariant derivative in terms of g a p, which we will call V^, such 
that the field equations for the dynamical metric can be written 

\~9 aP V Q Vpg^ - l -g afi 9 5 ^ a gs^09e V = -8ttG^3(T mv - \g^g ap T aP ) (193) 
i z y— g i 

The energy-momentum tensor satisfies the conventional conservation equation V^T„ y = 
0. 

Rosen's theory has been the subject of a number studies over the years. It has been 
found to be extremely successful when subjected to a PPN analysis, and compared to 
Solar System observations [779]. In fact, almost all the PPN parameters in Rosen's 
theory are indistinguishable from those of General Relativity. The only exception is 
«2 = v g/c 2 ~ lj where v g is the speed of gravitational waves in Rosen's theorem, and c is 
the speed of light. One should note here, however, that v g is not uniquely determined by 
the theory, but rather by the cosmological solutions to the theory. One can then adjust 
the initial conditions of the Universe in order to tune cti . If this is done then the theory is 
observationally indistinguishable from General Relativity in the weak field, low velocity 
regime of post-Newtonian gravitational physics. 

Rosen's theory fails, however, when its predictions for the emission of gravitational 
waves are compared to observations of binary pulsars. Will and Eardley found that unless 
the binary system under consideration obeys very specific properties, in terms of masses 
and mass differences, then Rosen's theory leads to the emission of a large amount of dipole 
gravitational radiation [1275] . This in turn results in a sizeable increase in the orbital 
period of the system, which is not observed. Binary pulsar observations are therefore 
incompatible with this theory. Rosen later proposed replacing flat space metric by an 
a priori specified, but time- varying, cosmological background I1069j . Unfortunately this 
does not circumvent the pulsar problem. 

Other bimetric theories that also have been proposed with an additional a priori 
specified, non-dynamical metric field. These include Rastall's theory [1055, 105(d] and 
Lightman and Lee's theory [808] . for which the PPN limits of both theories are known. It 
has been conjectured by Will, however, that all such theories that incorporate prior spec- 
ified geometry could suffer the same deficiency as Rosen's, when it comes to calculating 
the emission of gravitational radiation from binary systems |1274) . 



19 Note that this equation can be derived from an action principle by including a space-time dependent 
rank-2 tensor as a Lagrange multiplier. We will not go into the details of how to do this here. 
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3.3.2. Drummond's theory 

Let us now turn to a more recent formulation of the bimetric theory. In [434] it 
was proposed to work in the vierbein formulation with g a p = rjAB&a e ^ an d 9a@ = 
flfl^a^- I n this case both sets of vierbein are dynamical. While is used to construct 
the Einstein-Hilbert action, is used to construct the action from which the energy- 
momentum tensor is derived. The missing pieces of the theory are then a transformation 
tensor, Mg, and a scalar, 0, which relate and e^ by 

ej = e*M A B el (194) 
Finally, we then need to define a "linking action" . This is given by Drummond as 

+ 16^ / ^V^r^d^d^) 



16ttG 2 

where the current is defined as 



16ttGi J v 3 4 

' ' d^xy^m 2 ^ 2 , (195) 



^ S = .9 SC [V M M A B ]M S (5 , (196) 

and where Gi and G2 are new gravitational constants, m is a mass parameter, and 7 is 
a free parameter which corresponds to the cosmological constant. Note that the action 
for M is similar to that of the non-linear sigma model found in meson physics. 

Drummond has shown that his bimetric theory has a well defined Newtonian limit 
and so, in principle, can satisfy the time delay measurements from radio signals. He also 
claims that the higher order correction is exactly what is needed to satisfy Solar System 
constraints from the precession of the orbit of Mercury, and that in the weak field limit 
the dynamical metric g a p ~ r\ a p + h a p gives rise to a potential of the form 

GM ( X + ^ e -mr^ (197) 

Hence, for mr 1 the effective Newton's constant is Gn = G + Gi, while for large scales 
Gjv — G. Such a correction can alleviate the problem of flat galactic rotation curves that 
arises in standard Newtonian gravity with no dark matter, but does not completely 
resolve it. Albeit an intriguing proposal for a theory of modified gravity, there has been 
little progress in studying the various astrophysical and cosmological consequences of 
Drummond's theory. 

3.3.3. Massive gravity 

The theory of a single massive spin-2 field can also be considered as a bimetric theory, 
with a non-dynamical background metric g a p and a dynamical fluctuation given by g a p = 
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g a /3 + h a R- Taking the background to be Minkowski space, for simplicity, we can then 
generate a mass for the spin-2 field h a R by adding the Pauli-Fierz (PF) term to the 
Einstein-Hilbert action [504) . resulting in 

Spf = ^ J d A x^R + ^V^g [g^g ap - ^V"] Vm, (198) 

where m is a constant mass parameter. It is well known that, in four dimensions, a 
massive spin-2 field ought to have five propagating degrees of freedom: Two of hclicity 
2, two of helicity 1, and one of helicity 0. However, a generic mass term with arbitrary 
coefficients will result in higher derivative terms for the helicity-0 mode, giving rise to 
an additional ghost-like degree of freedom. The form of the PF mass term is specifi- 
cally chosen so that this is not the case to linear order. Massive gravity exhibits some 
interesting phenomenology, not least the so-called vDVZ discontinuity, and its possible 
resolution via the Vainshtein mechanism. These will be discussed in more detail in the 
context of DGP gravity in Section [5~5] 

Unfortunately, the PF Lagrangian by itself cannot describe a consistent theory be- 
cause the ghost-like mode reappears at non-linear order |175j . This mode is often referred 
to as the Boulware-Deser ghost, and it was believed that one could not find generalisa- 
tions of the theory that succeeded in eliminating it to all orders 342 . There has, however, 
been some recent progress on this issue by de Rham and Gabadadze and collaborators 
[5T71 EMU [3531 EH1IM5] who have proposed the following action [55211555] : 

Sgpf = [ d i xy/^jR + m 2 y/^jU(g,h), (199) 



16ttG _ 

where U(g, h) = J2n=2 a mS^J 1 • • ■ ^^^-vl ' ' ' ^-vZ an< ^ where the a m are constants, and 

K = Vtf + g» a h av -51= \g^ a h av - X - g ^h afi g^h lv + .... (200) 

It is now clear that the leading order part of the potential gives the PF mass term upon 
choosing 02 = 2. To study the behaviour of the theory beyond linear order it is convenient 
to restore general coordinate invariance by means of the Stuckelberg trick. To this end 
one can perform the following field redefinition |382[ [64] , 



= M^+^-^d^d^ (201) 

(202) 



M pl A| A 6 3 



where in going from Eq. (2011 to Eq. (202) we have set 4> a = x a — rf^ 'dpir / A3, in 
order to focus on the dynamics of the helicity zero mode. Note that A3 = (M^m 2 ) 1 / 3 



and Mpi = l/^/&-n:G. For the original PF action, Eq. (198), the Boulware-Deser ghost 
reveals itself by expanding the action in terms of h^ u . At zeroth order one finds higher 
derivative terms for 7r that contribute to the equations of motion, indicating the presence 
of the ghostly extra mode. In contrast, the generalised PF action is chosen so that the 
resulting higher derivative terms contribute a total derivative at zeroth order in h^ v . 
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This is a crucial first step in avoiding the extra mode. One can go further and study 
the theory in the so-called decoupling limit, m — ¥ 0, M p i — ¥ oo, and A 3 =constant. After 
some suitable field redefinitions one finds that the theory contains the quintic galileon 
Lag rangian [3 8 2) . Note that it does not reproduce the galileon theory discussed in Section 
exactly, since generically there is mixing with a graviton of the form WX 3 ^, where 



4.4 



X° v is cubic in n. This mixing cannot be eliminated by a local field redefinition and may 
have important phenomenological consequences. In particular, when this coupling comes 
in with a particular sign it can prevent the recovery of GR inside the Vainshtein radius 
around a heavy source [738, 7391 1297] . In any event, one can confidently say that the 
Boulware-Deser ghost does not appear in the decoupling limit. Of course, it is possible 
that this limit corresponds to taking its mass to infinity, and that it will reemerge in the 
full theory. Whether or not this is the case has yet to be established. 

Self- accelerating and self-tuning cosmologies were studied dc Rham and Gabadadze's 
theory in [384 , whilst spherically symmetric solutions have also been considered recently 
[7551 17591 1297] . 

3.3-4- Bigravity 

A class of theories that were first proposed in the 1970s by Isham, Salam, Stradthcc 
|647j . and revisited a few years ago by Kogan [355^ and collaborators, have recently been 
resurrected by Banados and collaborators [HI [70] (see for |1 150] a short overview). 
Further studies of bi-gravity include weak-field solutions and gravitational waves |140j , 
exact spherically symmetric solutions [141] and the energy of black holes 329 . The 
starting point [HI] is an extension of Eddington's affine theory (see section [2]) so that the 
dynamical fields are a metric g^ (with curvature scalar R) and a connection C" v with 
Ricci tensor K^[C]. The action is 

S[g,C] = ^ I d A x^-g{R - 2A) + A yj-detlg^ - Jk^] (203) 



16ttG J v * v ' aP 

where A is a cosmological constant, a is a dimensionless parameter and t is a length scale. 
It may be shown [85] by introducing a 2nd metric g^ corresponding to the connection 
C^„ that the above theory is a special case of bigravity with action given by 



(204) 



where A = is a cosmological constant term. In these theories, both metrics are used 
to build Einstein-Hilbert actions even though only one of them couples to the matter 
content. 

Such bigravity theories lead to interesting behaviour on cosmological scales [SSJ [70] . 
The homogeneous and isotropic FLRW metrics can be written as g a p = diag(—l, a 2 , a 2 , a 2 ) 
and g a /3 = diag(— X 2 , Y 2 , Y 2 , Y 2 ), where X and Y are functions of t alone. The corre- 
sponding Fricdmann equations are then of the form 

H 2 = 8 -^f(p + p), (205) 

where p = Y 3 /(8irG£ 2 Xa 3 ). This fluid satisfies a conventional conservation equation of 
the form 

g = -3(l + t2)p, (206) 
79 



where w, satisfies a somewhat intricate evolution equation, given by 



dw 

— = 2w 
dt 



l + 3u,+ W4(-^)3/^a-2 (1 + 3 ^ £ 



(207) 



where p c = p + p, pi = (SirGi 2 )^ 1 and 17 = /5//3 c . The extra metric here can lead to a 
range of interesting behaviours and, in particular, can drive the expansion to a de Sitter 
phase, or mimic the effects of dark matter. Anisotropic universes in these models were 
studied in [1066] . 

The cosmological evolution of perturbations in these theories has been worked out 
in some detail, and it turns out that the perturbations in the auxiliary field can be 
rewritten in the form of a generalised dark matter fluid [626] with density, momentum, 
pressure and shear that obey evolution equations. As a result, it is possible to work out 
cosmological observables such as CMB anisotropics and large-scale structure. In [70] it 
was found that distinctive signatures emerge during periods of accelerated expansion. If 
the p field dominates, and is responsible for cosmic acceleration, there is a clear instability 
in the gravitational potentials; they not only grow but diverge leading very rapidly to an 
overwhelming integrated Sachs- Wolfe effect on large scales. It is difficult to reconcile the 
angular power spectrum of fluctuations and the power spectrum of the galaxy distribution 
predicted by a bimetric theory that unifies the dark sector with current data. If we restrict 
ourselves to a regime in which p simply behaves as dark matter, however, then the best-fit 
bimetric model is entirely indistinguishable from the standard CDM scenario. 

Bigravity theory can also be extended to consider more complicated actions, such as 

1 t *, 



Sl = J ?xy/-g [K (~g-T"9«f> + Ki((FT'Wr + K2(~g-T P (r%p\ ,(208) 

and, although a full analysis of its PPN parameters has been undertaken |308| . its cosmol- 
ogy remains to be explored. Black holes, and their thermodynamics, have been studied 
in bimetric gravity in [71] . 

3.3.5. Bimetric MOND 

A bimetric theory of MOND, somewhat akin to bigravity, has recently been proposed 
by Milgrom [896]. The action for bimetric MOND, or BIMOND, is of the form 

S = 16ttG 



/ d*x \fiy/^gR + ayf^g~R- 2{gg) 1/4 a 2 M 
ttG J L 1 J 

SM(9nv,i>) ~ S M (g^,ip) (209) 



where M is the interaction term that connects the two metrics, and ip and ip are the 
matter fields that couple to g^ and g^ u , respectively. The factor M is a non- linear 
function of the tensor T^^, given by 

T^ = C a lifl C f, va -C a lu ,C a aa , (210) 

where 

cV = rV- r V> (2ii) 
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and r Q Ml/ and f a ^ are the Christoffel symbols constructed from g^ v and g p „ , respectively. 
Note that the even though the Ts are not tensors, C, constructed in this way, is a tensor. 
The constants a and j3 can be kept unrelated, leading to different gravitational couplings 
in the two sectors. If we set a = j3, however, but leave /3 arbitrary, then we get the field 
equations 

PG/xu + V = -8^GT M „ (212) 

and 

PG^ + S^^-torGf^, (213) 

which look like the conventional Einstein equations, except for the contributions from 
Spiv and which contain the interaction terms between the two metrics. These tensors 
are quadratic in and are non- linear functions of g, g and g^g^v 

This theory has been constructed to reproduce MOND phenomenology on small 
scales, in the weak field and low acceleration regime. Its cosmological implications have 
been studied in {3151 1897j , where it was shown that in the high acceleration regime BI- 
MOND reproduces conventional FLRW behaviour. In low acceleration regime, however, 
we have that the scale factor a(r) (where r is conformal time) can take the form the 
form a ~ r p , where [315] 

1 — 3ii) , . 

P=7 ~ : zr- ( 214 ) 

(1 + 2w + 8nw — 3w — 8nw — 6ww) 

and where w and w are the equations of state for the matter coupled to g^ and g^, and 
n is one of the parameters in A4. This leads to an interesting range of behaviours. For 
example, it is possible to have a dust filled universe that is static, if the matter coupled 



to the second metric is radiation. It is shown in |897j that to calculate fluctuations about 
an FLRW background in either metric requires a knowledge of the matter coupled to 
both metrics. It is also shown that the growth of fluctuations does not proceed in a 
purely Newtonian way, but has a MOND contribution as well. 

3.4- Tens or- Vector- Scalar Theories 

In General Relativity, the space-time metric g^ v is the sole dynamical agent of gravity. 
We have seen above that scalar-tensor theories extend this by adding a scalar field that 
mediates a spin-0 gravitational interaction, while in Einstein-aether theories one makes 
use of a vector field. TeVeS has both of these types of fields as extra degrees of freedom: A 
scalar field, <fr, and a (dual) vector field, both of which participate in the gravitational 
sector. Like GR, it obeys the Einstein equivalence principle, but unlike GR it violates 
the strong equivalence principle. 

TeVeS is a product of past antecedent theories, namely the Aquadratic Lagrangian 
theory of gravity (AQUAL) and its relativistic version 125 , the phase-coupling gravita- 
tion |126| . the disformal relativistic scalar field theory [130J, and the Sanders' stratified 
vector field theory |KT93J. Since its inception |128j TeVeS has been intensively researched, 
including studies of cosmology [HH1 15221 EMI SH SSI HUB E3 [Ml HUE I5D2] . 
spherically symmetric solutions [1281 15351 16601 110821 17741 11154) , gravitational collapse 
and stability [TT241 l33i"] . solar system tests [1281 ESS Q2H [TT981 [T08T] . gravitational 
lensing [TM1 HS3 HHi H5D71 ITSMl 12571 IHl IT39T1 111251 12~M1 1570] . issues of superlumi- 
nality [205] , and the travel time of gravitational waves [664, 6631 1414j . A thorough and 
up-to-date review of TeVeS can be found in |1152j . Here we will concentrate mostly on 
cosmological features of the theory. 
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3.4-1- Actions and field equations 

The original and most common way to specify TeVeS is to write the action in a mixed 
frame. That is, we write the action in the "Bekenstein frame" for the gravitational fields, 
and in the 'physical frame', for the matter fields. In this way we ensure that the Einstein 
equivalence principle is obeyed. The three gravitational fields are the metric, (with 
connection V Q ), that we refer to as the Bekenstein metric, the Sanders vector field, A^, 
and the scalar field, (f>. To ensure that the Einstein equivalence principle is obeyed, we 
write the action for all matter fields using a single 'physical metric', g^ (with connection 
V M ), that we call the 'universally coupled metric' p°| The universally coupled metric is 
algebraically defined via a disformal relation |127| as 

9»u = e-^g^ - 2 sinh(20)A^„. (215) 

The vector field is further enforced to be unit-time-like with respect to the Bekenstein 
metric, i.e. 

gTApK = -1. (216) 
The unit-time-like constraint is a phenomenological requirement for the theory to give 



an acceptable amount of light bending. Using the unit-time-like constraint, Eq. (2161, 
it can be shown that the disformal transformation for the inverse metric is 

g ^ = e 2</>~^ + 2 s inh(2<£) jIM", (217) 

where A^ = g^ v A v . The existence of a scalar and a vector field may seem odd at first, 
but they are both the product of a series of extensions from older theories, based on 
theoretical and phenomenological constraints. 



Actions 



TeVeS is based on an action, S, which is split as 

S = S s + S A + S + S m , (218) 

where S g , Sa, and S m are the actions for g^, the vector field, A^, the scalar field, 
(j>, and matter fields, respectively. 

As already discussed, the action for g^ u , A^ , and (f> is written using only the Bekenstein 
metric, g^, and not g^ Vl and is such that Sg is of Einstein-Hubert form 

where g and R are the determinant and scalar curvature of g^„, respectively, while G 
is the bare gravitational constant. The relation between G and the measured value of 
Newton's constant, Gn, will be elaborated on below, in Section [3. 4. 2| 



20 Some work on TeVeS, including the original articles by Sanders |1093j and Bekenstein 128], refer 
to the Bekenstein frame metric as the "geometric metric", and denote it as g^v, while the universally 
coupled metric is referred to as the "physical metric", and is denoted by Since it is more common 

to denote the metric which universally couples to matter as g^ v , in this review we interchange the tilde. 
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The action for the vector field, A„, is given by 



Sa = - / d 4 x [KF^F^ - 2\(A fi A» + 1)] , (220) 

where F^ v — V ^Ay — V leads to a Maxwellian kinetic term, A is a Lagrange multiplier 
that ensures the unit-timelike constraint on A^, and K is a dimcnsionlcss constant. 
Indices on F^ v are moved using the Bekenstein metric, i.e. F^ v = g^ a F av . This form 
of a vector field action has been considered by Dirac as a way of incorporating electrons 
into the electromagnetic potential 424, 425, 426J. More recently it has been considered 



as a natural generalisation of GR, in the Einstein-aether theories discussed in Section 3.2 
[6561 [655]. 

The action for the scalar field, cf>, is given by 



50 ~ \tm-c: 



^ J d 4 x^g [fx g^V^V^ + V((i)] , (221) 

where fi is a non-dynamical dimensionless scalar field, g^ u is a new metric defined by 

g"" = g" v -A"A U , (222) 

and V(/i) is an arbitrary function which typically depends on a scale, Ib- Not all choices 
of V([i) give the correct Newtonian or MONDian limits in a quasi-static situation. The 
allowed choices are presented in Section |3.4.2| The metric g^ u is used in the scalar field 
action, rather than g^" , to avoid the superluminal propagation of perturbations. Note 
that it is possible to write the TeVeS action using g^ , with the consequence of having 
more general vector field kinetic terms (see the appendix of |1152j ). 

It is also easier in some cases to work with an alternative form for the scalar field 
action that does not have the non-dynamical field, /i, but rather has the action written 
directly in terms of a non-canonical kinetic term for <f> given by a free function f(X), 
with X defined by 

X = l 2 B g» v XJ^ v (j>. (223) 

The field /i is then given in terms of f(X) by fi = while f{X) can be related to V 
by f = fxX + £ 2 B V. 

The matter fields in the action are coupled only to the 'universally coupled metric', 
g^v, and thus their action is of the form 

S m [g,X A ,V X A } = J d i x^gL[g lX A ^X A }, (224) 

for some generic collection of matter fields, x A ■ The matter stress-energy tensor is then 
defined with respect to SS m in the usual way. 

It should be stressed that the action for the scalar field has been constructed such 
that the theory has a MONDian non-relativistic limit, under the right conditions, for 
specific choices of functions V{n) (or equivalently F(X)). The action for the vector field 
has no particular significance other than the fact that it is simple. More general actions 
can be considered without destroying the MOND limit, but that in addition provide new 
features or improved phenomenology. 
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The field equations 



The field equations of TeVeS are found using a variational principle. This gives two 
constraint equations, namely the unit-timelike constraint, given in Eq. (216), and the 
/i-constraint: 

<*»> 



that is used to find fi as a function of V M 0. The field equations for are given by 
Gp V = 8tvG [Tfj, v + 2(1 — e~^)A a T a ^A v )] 



+K 



V„0V„0 - 2,4 Q V a </> A (ll V v) <t>\ + - (p,V - V) fa 
F a .F n „ - l F al3 F, 



(226) 



where G^ is the Einstein tensor constructed from g^ v . The field equations for the vector 



field, Ap, are 



and the field equation for the scalar field, <p, is 



3vrG(l - e-**)A v T utl , 



87rGe~ 20 [g^ + 2e" 2< ^ 'A" ~A V ] T^. 



(227) 



(228) 



The Lagrange multiplier can be solved for by contracting Eq. (227) with A 11 . 

3.4-2. Newtonian and MOND limits 

We now describe the quasi-static limit, as relevant for establishing the existence of the 
Newtonian and MONDian limits. The details of the derivation we consider here can be 
found in |128j , while an alternative shorter derivation in the spirit of the PPN formalism 
is given in |1152j . 

It can be shown that the PPN parameter 7 is unity in TeVeS, hence the universally 
coupled metric can be written as 



ds l 



-(1 + 2>S>)dt 2 + (1 - 2<Z>)5 ij dx i dx j , 



(229) 



where $ is related to the acceleration of particles, a, by a = — V$. The scalar field is 
perturbed as 4> — <\>c + where <f> c is the cosmological value of <f). The Bekenstein metric 
takes a similar form to g^ u : 



d~s 2 = -e -2 *°(l + 2$>)dt 2 + e 2 *°(l - 2$)5 ij dx l dx j . 



(230) 



The vector field does not play a role at this order of perturbations, and is simply given 
by 



A M = e-*»(-l-$, 0,0,0). 
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(231) 



The field equations to 0(v 2 ) are then 



and 



A £ 87rG 



iirGp, 



(232) 



(233) 



while the potential <£> is given via the disformal transformation $ = <& + ip. Eqs. ( 232 ) and 



( 233 1 can be solved for any quasi-static situation, regardless of the boundary conditions 



or the symmetry of the system in question, provided a function //(|Vy|) is supplied. 
To find the Newtonian and MONDian limits we can consider, for simplicity, spher- 



ically symmetric situations. In this case we can combine Eqs. (232 1 and (233) into a 
single equation for $, called the AQUAL equation: 



where 



4ttG n p, 



2G 



2-K 



(234) 



(235) 



The ratio Gn /G is not free, but is found by taking the limit \i m — > 1, i.e. the Newtonian 
limit. Consistency requires that jj, — > which is then a constant^] contained in the 
function / (or V). This gives the relation 



Gn 

~g 



Mo (2 - K) 

The MOND limit is now clearly recovered as fi m ^ v ^"^ , and we get 



(236) 



where X is given in Eq. ( 
find the function f(X), w 



2G_m = 2GJ_ e<t>cV X ) 
Gn a-o Gn ^bo,q 



(237) 



223 1 . Since /i = Mr , we may integrate the above equation to 



rich in the MOND limit should be given by 



2 

31b ao 



j_ 



1 

2-K 



(238) 



where the integration constant has been absorbed into the cosmological constant associ- 
ated with the metric g^. Since both X and / are dimensionless we may define a new 
constant /3q, such that Oq is a derived quantity given by 



a 



2 



1 



1 

2-K 



(239) 



The constant fig is related to the constant k introduced by Bekenstein as /^o = 
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and the function / has the MONDian limit / — > (3qX 3 / 2 . Since in the Newtonian limit 
we have / — > (1qX, there are at least three constants that can appear in f(X), namely 
/i , Po and £ B - 

In terms of the function ^ the MONDian limiting case implies that ^ — > — 9 ^a £ a /i 2 
as /i — > 0, while it diverging as fi — ¥ /io in the Newtonian limit. This second limit is 
imposed if ^ — > (/io — M) _m ) f° r some constant m. Bekenstein chooses this to be m = 1, 
although other choices are equally valid, even functions that have essential singularities. 



It is clear from Eq. ( 239 1 that ao depends on the cosmological boundary condition, 
(j) c , which can differ for each system, depending on when it was formed. It could thus be 
considered as a slowly varying function of time. This possibility has been investigated 
by Bekenstein and Sagi |129j . and by Limbach et al. |810j . 

The two limiting cases for f(X) are somewhat strange. In particular we require that 
f(X) — > X for X > 1 to recover the Newtonian limit, and that f{X) — > X 3 / 2 for 
I < 1 (i.e. a higher power) to recover the MONDian limit. This signifies that in this 
kind of formulation of relativistic MOND (i.e. in terms of a scalar field) the function 
f(X) should be non-analytic. It further signifies that f{X) can be expanded in positive 
powers of \[X for small X, and in positive powers of for large X, but that these 
two expansions cannot be connected. In other words, it is impossible to perturbatively 
connect the Newtonian regime with the MONDian regime via a perturbation series in 
|V^|. 

The Bekenstein free function in |128j is given in the notation used in this review by 

dV _ 3 /i 2 (//-2/i ) 2 



dfi 32tt£ 2 b (iI (/i - /i) 



(240) 



which means that /?o = § \ 7 g*° , and thus 



a = 1 • (-'-Ill 



2-K 



This is in agreement with |129) (the authors of |177j erroneously inverted a fraction in 
their definition of ao). 



3.4-3. Homogeneous and isotropic cosmology 

Homogeneous and isotropic Friedmann-Lemaitre-Robertson- Walker (FLRW) solu- 
tions to the field equations of TeVeS have been extensively studied [1281 I592( 111531 
EH gUl [Tim [T771 IT3U51 ITX381 151121 . In this case the universally coupled metric can be 
written in the conventional synchronous form as 

ds 2 = -dt 2 + a 2 (t) qij dx l dx j , (242) 

where a(t) is the 'physical scale factor'. Here we assume for simplicity that the hyper- 
surfaces of constant t are spatially flat (see [1147] for the curved case). The Bekenstein 
metric then has a similar form, and can be written as 



ds 2 = -dP + b 2 (i)q ij dx i dx j , 
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(243) 



for a second scale factor b(t). The disformal transformation relates the two scale factors 
by a = be~^, while the two time coordinates t and i are related by dt = e^dt. The 
physical Hubble parameter is defined as usual as by H = ^, while the Bekenstein frame 
Hubble parameter is H = ^^e^H + ^t. Cosmological evolution is governed by the 
analogue of the Friedmann equation: 

ZH 2 = 87rGe- 2 ^ { P<f> + p) , (244) 

where p is the energy density of physical matter, which obeys the energy conservation 
equation with respect to the universally coupled metric, and where the scalar field energy 
density, p^, is given by 

dV 



rv 16^G V dp, 
Similarly, one can define a scalar field pressure by 



p— +V). (245) 



The scalar field evolves according to the two differential equations: 

d t = -±T, (247) 

and 

-= + 3HT = 8irGe- 2 4> ( p + 3P) , (248) 
dt 

where p is found by inverting <j>' 2 — |^p- 

It is important to note that the vector field must point to the time direction, so that 
it can be written as = (VsoOj 0)- In this case it does not contain any independent 
dynamical information, and it does not explicitly contribute to the energy density. Its 
only effect is on the disformal transformation which relates the Bckcnstcin-framc Fried- 



mann equation, Eq. (244), with the physical Friedmann equation. This is also true in 
cases where the vector field action is generalised, and where the only effect is a constant 
rescaling of the left-hand-side of the Bekenstein-frame Friedmann equation, as discussed 

in [rngj . 

FLRW solutions with the Bekenstein function 

In the original TeVeS paper |128j Bekenstein studied the cosmological evolution of 



an FLRW universe by assuming that the free function is given by Eq. ( 240 ) . He showed 
that the scalar field contribution to the Friedmann equation is very small, and that <J) 
evolves very little from the early universe until today. He noted that with this choice of 
function, a cosmological constant term has to be added in order to have an accelerating 
expansion today, as appears to be required by cosmological observations. 

Many other studies on cosmology in TeVeS have also used the Bekenstein function, 
see for example |592[ I1153[ I427[ 11148] . In particular, Hao and Akhoury noted that the 



integration constant obtained by integrating Eq. (240) can be used to get a period 
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of accelerating expansion, and that TeVeS therefore has the potential to act as dark 
energy |592j . However, such an integration constant cannot be distinguished from a bare 
cosmological constant term in the Bekenstein frame, and so it is somewhat dubious as to 
whether this can really be interpreted as dark energy arising from TeVeS. Nevertheless, it 
would not be a surprising result if some other TeVeS functions could, in fact account for 
dark energy, as the scalar field action in TeVeS close resemblances that of k-essence [65l 
[55] , Zhao has investigated this issue further [1305] fsee below). 



Exact analytical and numerical solutions with the Bekenstein free function, Eq. ( 240 1 , 
have been found by Skordis et al.vn |1153| . and by Dodelson and Liguori in |427j . It turns 
out that not only, is the scalar field is subdominant, as Bekenstein noted, but its energy 
density also tracks the matter fluid energy density. The ratio of the energy density in 
the scalar field to that of ordinary matter then remains approximately constant, so that 
the scalar field tracks the matter dynamics. One then gets that 

- in-- < 24S » 

6(l-w) 2 /i 

where w is the equation of state of the matter fields^} Since ^ is required to be very 
large, the energy density in the scalar field is always small, with values typically less than 

~ 10 -3 in a realistic situation. Tracker solutions are also present for this choice of 
function in versions of TeVeS with more general vector field actions |1148j . 

In realistic situations, tracking in the radiation era is almost never realised, as has 
been noted by Dodelson and Liguori [427] . Rather, during the radiation era, the scalar 
field energy density is subdominant but slowly growing, such that <fi °^ a 4 / 5 . However, 
upon entering the matter era <f> settles into the tracker solution. This transient solution 
in the radiation era has been generalised by Skordis to arbitrary initial conditions for 
<j>, more general free functions (see below), and a general vector field action |1148j . It 
should be stressed that the solution in the radiation era is important for setting up initial 
conditions for the perturbations about FLRW solutions that are relevant for studying 
the CMB radiation and Large-Scale Structure (LSS). 



From Eq. ( 239 ) we see that ao for a quasi-static system depends on the cosmological 
value of the scalar field at the time the system broke off from the expansion, and collapsed 
to form a bound structure. It is then possible that different systems could exhibit different 
values of ao depending on when they formed. The impact of evolving ao on observations 
has been investigated in |129l 1810] . 

Finally, note that the sign of </> changes between the matter and cosmological constant 
eras. In doing so, the energy density of the scalar field goes momentarily through zero, 
since it is purely kinetic and vanishes for zero 4> [1153 . 

FLRW solutions by generalising the Bekenstein function 

Bourliot et al. |177j studied more general free functions, that have the Bekenstein 
function as a special case. In particular they introduced two new parameters, a constant, 



22 Note that this excludes the case of a stiff fluid with w = 1. 



,X,X 



fi ai and a power index, n, such that the free function is generalised to 



dV (n) = 3 M 2 ( M - Ma /i )" 
<izx 327r£ 2 j/x 2 1 /xq — 

This function^] reduces to the Bekenstein function when n — 2 and /i a — 2. It retains 
the property of having a Newtonian limit as /i — > fiQ and a MOND limit as \i — > 0. The 
cosmological evolution depends on the power index, n. More general functions can also 
be constructed by considering the sum of the above prototypical function with arbitrary 
coefficients, i.e. by taking <g = £„ c n ^ [177] . 

Clearly ^(fi a fi ) = 0, and at this point <p —> 0. Now suppose that the integration 
constant is chosen such that V([i a [i ) = as well. Then, just like the case of the 
Bekenstein function, one finds tracker solutions: The function // is driven to fx = //„//(). 
at which point = 0. There are no oscillations around that point, but it is approached 
slowly so that it is exactly reached only in the infinite future. The scalar field relative 
density is now given by 

(l + 3u,) 2 
3/i a (l - wj-^o 

independent of the value of n. It should also be pointed out that the evolution of the 
physical Hubble parameter, H , can be different than the case of GR even in the tracking 
phase |177j . For example in the case » = 0we have H cx a~ nh , where rih = 2( t ff°^n • 

Furthermore, just like the Bekenstein case, the radiation era tracker is untenable 
for realistic cosmological evolutions, for which /io must be large so that is small 
(< 10~ 3 ). In this case we once again get a transient solution where the scalar field 
evolves as oc a 4 ^ 3 " 1 ""-' |1148| . In the case that the integration constant is chosen 
such that V(fj, a fio) ^ one has an effective cosmological constant present. Thus, once 
again, we get tracker solutions until the energy density of the Universe drops to values 
comparable with this cosmological constant, at which time tracking comes to an end, 
and the Universe enters a de Sitter phase. 

The cases — 2 < n < turn out to be pathological as they lead to singularities in the 
cosmological evolution [177] . The case n = —3 is well behaved when the matter fluid is a 
cosmological constant, but is also pathological when w = — 1 |177j . The cases for which 
n < —4 are well behaved in the sense that no singularities occur in the cosmological 
evolution. Contrary to the n > 1 cases, the cosmological evolution drives the function 
H to infinity. Moreover, these cases do not display the tracker solutions of n > 1, but 
rather the evolution of p§ is such that it evolves more rapidly than the matter density, 
p, and so quickly becomes subdominant. The general relativistic Friedmann equation is 
thus recovered, such that 3H 2 — 8nGp. This also results in H = H, which means that 
the scalar field is slowly rolling. 

The evolution of the scalar field variables T, (f> and /i then depends on the equation 
of state of the matter fields. If the background fluid is a cosmological constant, then we 
get de Sitter solutions for both metrics, and it can be shown that T — 2H(e~ 3Ht — 1). 



23 Note that [1 77] uses a different normalisation for V, and their results can be recovered by rescaling 
the ig used in this report by £g — > t-B\j i/^o" 3 ' 
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For the case of a stiff fluid, with w = 1, we get that T has power-law solutions that are 
inverse powers of t, so that T = | + A similar situation arises when — 1 < w < 1, for 

which we get V = i? , and the Hubble parameter evolves as H = gn^T) j- Notice 

that the limit it) — > 1, for the — 1 < u> < 1 case, does not smoothly approach the w = 1 
case. 

Mixing different powers of n > 1 leads once again to tracker solutions. One may 
have to add an integration constant in order to keep V(p a pa) — 0, although for certain 
combinations of powers n and coefficients c% this is not necessary. Mixing n — with some 
other n > 1 cannot remove the pathological situation associated with the n = case. 
Mixing n = with both positive and negative powers could however lead to acceptable 
cosmological evolution since the effect of the negative power is to drive p away from the 
p = p a po point. In general, if we mix an arbitrary number of positive and negative 
powers we get tracker solutions provided we can expand the new function in positive 
definite powers of (p — p' a po), where p' a is some number different from the old p a . 

The observational consequences for the CMB and LSS have not been investigated for 
this class of function, unlike the case of the Bekenstein function. 

Inflationary /accelerating expansion for general functions 

Diaz-Rivera, Samushia and Ratra [4 19) have studied cases where TeVeS leads to 
inflationary, or self-accelerating, solutions. They first consider the vacuum case, in which 
they find that de Sitter solutions exist with b ~ e Hot , where the Bekenstein frame Hubble 

constant Ho is given by the free function as Hq = y ^p, and where ^ = (i.e. the 
scalar field is constant, <f> = (pi). Such a solution will always exist in vacuum provided 
that the free function satisfies 7^7 (Mi>) — an d V(jh>) 0) f° r some constant fi v . In that 
case, the general solution is not de Sitter since both <fi and fi will be time-varying, but 
will tend to de Sitter as /1 — > /i v . Indeed, the n > 1 case of Bourliot et al. [177] with an 
integration constant is precisely this kind of situation. 

In the non-vacuum case, for a fluid with equation of state P — wp, they make 
the ansatz 6 3 ( 1+lu ) = e ( 1 +3™)0 > This brings the Friedmann equation into the form 
3H 2 = 8irGpo + 5(^^)7 + V), where po is the matter density today. Once again, they 
assume that the free-function-dependent general solution drives p to a constant p v , but 
4> is evolving. Thus, we must have that (f> = <f>\t + 2 j such that <f> = <pi is a constant. In 
order for <f>\ to be non-zero we must have ^(Pv) 7^ 0. However, there is a drawback to 
this approach. As they point out, consistency with the scalar field equation requires that 
w < — 1. Furthermore, although this solution is a de Sitter solution in the Bekenstein- 
frame, it corresponds to a power-law solution for the universally coupled metric. In order 
for this power-law solution to lead to acceleration, they find that —5/3 < w < —1. This 
range of w corresponds to a phantom fluid. 

Accelerated expansion in TeVeS 

The simplest case of accelerated expansion in TeVeS is provided by a cosmological 
constant term. This is equivalent to adding an integration constant to V(p) |592l 1177] . 
and it corresponds to the accelerated expansion considered by Diaz-Rivera, Samushia 
and Ratra 419 in both the vacuum and non- vacuum cases (see above). These solutions 
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therefore suffer from the usual fine-tuning and coincidence problems, and so it is of 
interest to look for accelerating solutions without such a constant, simply by employing 
the scalar field (these need not be de Sitter solutions). 

Zhao used a function ^ oc /i 2 to obtain solutions which provide acceleration, and 
compared his solution with the SNla data |1305j . finding good agreement. However, it 
is not clear whether other observables, such as the CMB angular power spectrum, or 
observations of LSS are compatible with this function. Furthermore, this function is not 
realistic, as it does not have a Newtonian limit (it is always MONDian). Although no 
further studies of accelerated expansion in TeVeS have been performed, it is plausible 
to think that certain choices of function could lead to acceleration. This is because the 
scalar field action has the same form as a k-essence/k-inflation action [551 ISSJi which 
has been considered as a candidate theory for acceleration. More precisely, the system 
of cosmological equations corresponds to k-essence coupled to matter. It is not known 
in general whether this type of model has similar features as the uncoupled k-essence 
models, although Zhao's study indicates that this a possibility. 

Realistic FLRW cosmology 

In TeVeS, cold dark matter is absent. Therefore in order to get acceptable values 
for the physical Hubble constant today (i.e. around H Q ~ 70 km s _1 Mpc -1 ) , we have 
to supplement the absence of CDM with something else. The reason for this is simply 
that if all the energy density in the Universe today was in the form of baryons, then the 
Hubble constant would be lower than what is observed by a factor of ~5. Possibilities 
of what this supplementary material could be include the scalar field itself, massive 
neutrinos 11531 I5U2"] . and a cosmological constant. At the same time, one has to get 
the right angular diameter distance to recombination |502j . These two requirements can 
place severe constraints on the allowed form of the free functions. 

3.4-4- Cosmological perturbation theory 

Cosmological perturbation theory in TeVeS has been formulated to linear order in 
|1147j . and in variants of TeVeS with more general vector field actions in [1148 . The 
scalar modes of the linearly perturbed universally coupled metric are given in the con- 
formal Newtonian gauge, as usual, by 

ds 2 = -a 2 (l + 2*)dr 2 + a 2 (l - 2$)q ij dx i dx j , (252) 

where r is conformal time, defined by dt = adr. Here, we will assume, for simplicity, 
that the spatial curvature is zero. The reader is referred to [1147[ 11148 for the curved 
cases, as well as for an enunciation of vector and tensor perturbations. The scalar field 
is perturbed as <j> — <j> + ip, where <t> is the FLRW background scalar field, and ip is the 
perturbation. The vector field is perturbed as — ae~^(l + * — <p, V,a), such that 
the unit-timelike constraint is satisfied. This removes the time component of as an 
independent dynamical degree of freedom. Thus, there are two additional dynamical 
degrees of freedom, when comparing to cosmological perturbation theory in GR: The 
scalar field perturbation, ip, and the vector field scalar mode, a. 

The perturbed field equations for the scalar modes can be found in the conformal 
Newtonian gauge in J153J, and in more form (including in the synchronous gauge) 
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in |1147j . Perturbation equations for more general TeVeS actions are given in |1148j . 
Here we only present the Newtonian gauge equations of the original TeVeS formulation. 
We define the following variables: $ = $ — <p, iff = if> — <p : £ = — (1- e~ 4 ^)a and H=\. 
The scalar field obeys the two first order equations 



-3^7 +-e' 3 h 2 (<p + fia) 
a 



[6$' + 2fc 2 C 



SirGae 



-30 



S Pf + 3SP f + J2(Pf + 3P/) (* - 2^) 
/ / 



and 



The vector field equations are given by 
K 



(E' + HE) = -fitffa - (j>'a) + 8irGa 2 {l - e" 4 *) ^(p f + P f )(0 f - a) 

f 



and 



a' = E + * + (4>' - H) a 
and finally the Einstein equations are given by the Hamiltonian constraint 



-2 (fc 2 - 3k) $ - 2e^H [3$' + fc 2 C + 

—Kk 2 E = 87rGa 2 p f [S f - 2tp] 
f 



the momentum constraint equation 



& + n( + H^ - p<f>'ip = 47rGa 2 e- 4 ^ ^(p/ + P/)^ 

/ 



and the two propagation equations 



6$" + 2k 



((' - e _4< ^ + 2e~^ (fc 2 - 3k) $ + 2H 6$' + 3#' + 2fc 2 ( 



[3$' + fc 2 Cj + 3^«T Vt + 6 2H' + H 2 + Acjy'H 



2AirGa 2 e- 4 ' p (SP - 2Pip) 
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Let us now turn to the problem of specifying initial conditions for the scalar modes, 
which in general should depend on the chosen form of the free function. The exact 
adiabatic growing mode in TeVeS, and generalised variants, have been found by Skordis 
in [1148] , but only for the case of the generalised Bekenstein function. If the free function 
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is such so that the scalar field contribution to the background expansion during the 
radiation era is very small, however, then the adiabatic modes for other free functions 
should be only marginally different from the ones found in [1148] ■ In particular, the only 
effect should be a difference in the initial conditions of tp, which is not expected to make 
any difference to observations. 

The only study that has been performed of the observational signatures of TeVeS in 
the CMB radiation and in LSS is due to Skordis, Mota, Ferreira and Boehm |1153j . Here 
the initial conditions were chosen such that both tp and a, as well as their derivatives, 
are initially zero. While this is not a purely adiabatic initial condition, it turns out that 
it is close enough to ensure that no observable difference can be seen from isocurvature 
contamination. Detailed studies of isocurvature modes in TeVeS have not yet been 
conducted. In the light of the problems that TeVeS has with observations of the CMB 
radiation |1153) . however, it may be important to investigate what effects isocurvature 
modes are likely to have. Preliminary studies by Mota, Ferreira and Skordis have shown 
that setting the vector field perturbations to be large initially can have a significant 
impact [9T0] . 

In addition to the four regular isocurvature modes that exist in GR, there could in 
principle exist four further isocurvature modes in TeVeS: Two associated with the scalar 
field, and two associated with the vector field. Preliminary studies by Skordis have shown 
that none of the scalar field isocurvature modes are regular in either the synchronous 
or conformal Newtonian gauges. Conversely, under certain conditions of the vector field 
parameters one of the vector field isocurvature modes can be regular, while the other one 
is never regular. Thus, it may be possible to have one regular isocurvature mode in the 
TeVeS sector. The observational consequences of this mode are still unknown, as is its 
generation method from early universe inflation. Studies of the observable spectra based 
on vector or tensor modes are also yet to be conducted although the necessary equations 
can be found in pT47lfTT48] . 

3.4-5. Cosmological observations and constraints 

Let us now consider the observational signatures of the perturbation theory discussed 
above, and how they can be used to constrain TeVeS. 

Large-scale structure observations 

A traditional criticism of MOND-type theories was their lack of a dark matter com- 
ponent, and therefore their presumed inability to form large-scale structure compatible 
with current observational data. This criticism was based on intuition formed from a 
general relativistic universe filled with baryons only. In that case it is well known that, 
since baryons are coupled to photons before recombination, they do not have enough 
time to grow into structures on their own. Furthermore, their oscillatory behaviour at 
recombination is preserved, and is visible as large oscillation in the observed galaxy power 
spectrum P gg (k). Finally, on scales smaller than the diffusion damping scale they are 
exponentially suppressed due to Silk damping. Cold dark matter solves all of these prob- 
lems because it does not couple to photons, and therefore can start creating potential 
wells early on in the Universe's history, into which the baryons can fall. This is enough 
to generate the right amount of structure, erase most of the oscillations, and overcome 
the Silk damping. 
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TeVeS contains two additional fields, not present in GR, that change the structure of 
the equations significantly. The first study of large-scale structure observations in TeVeS 
was conducted by Skordis, Mota, Ferreira and Bcehm in [1153] . Here the perturbed 
TeVeS equations were solved numerically for the case of the Bekenstein function, and 
the effects on the matter power spectrum, P(k), were determined. It was found that 
TeVeS can indeed form large-scale structure compatible with observations, depending 
on the choice of TeVeS parameters in the free function. In fact, the form of the matter 
power spectrum, P(k), in TeVeS looks quite similar to the corresponding spectrum in 
ACDM. Thus, one has to turn to other observables to distinguish the TeVeS from General 
Relativity. 

Dodelson and Liguori 427 provided an analytical explanation of the growth of struc- 
ture that was found numerically in |1153j . They concluded the growth in TeVeS cannot 
be due to the scalar field, as the scalar field perturbations are Bessel functions that decay- 
ing in an oscillatory fashion. Instead, they reasoned, the growth of large-scale structure 
in TeVeS is due to the vector field perturbation. 

Let us see how the vector field leads to growth. Using the tracker solutions in the 
matter era, from Bourliot et al. |177j . we can find the behaviour of the background 
functions a, b and cf>. Using these in the perturbed field equations, after setting the 
scalar field perturbations to zero, it can be shown that in the matter era the vector field 
perturbation a obeys the equation 



a -i a - 

T 

in the conformal Newtonian gauge, where 



bi 
b 2 



MoA*a + 3 
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(M0A*a + 3) 2 
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5 + K ' MoMa 
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(261) 

(262) 
(263) 



and where S is a source term which does not explicitly depend on a. If we simultaneously 
take the limits [1q — > oo and K —> 0, for which f2<^ — > 0, meaning that the TeVeS 
contribution is absent, then we get b\ — > 4 and b 2 — > 2. In this case the two homogeneous 
solutions to Eq. (261) are t~ 2 and r _1 , which are decaying. 

Dodelson and Liguori show that the source term S(^>, Vt, 9) is not sufficient to create 
a growing mode in the general solution to Eq. (261 ), and that in the general relativistic 



limit TeVeS does not, therefore, provide enough growth for structure formation. Now let 



us consider the general case. Assuming that the homogeneous solutions to (261) can be 
written as r™, it can be shown that for the generalised Bekenstein function of Bourliot 
et al. |177j we can get 

' "' (264) 



1 



2 2 V KnoHa 
Thus, we can have n > 0, provided that for fixed fiofi a we also haver 



K < 0.01. 



(265) 



Smaller values of /ioMa can a l so raise this threshold. 
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Figure 3: LEFT: The evolution in redshift of baryon density fluctuations in TeVeS (solid line), and in 
ACDM (dashed line) for a wavenumber k = 10 _3 Mpc _1 . In both cases, the baryon density fluctuates 
before recombination, and grows afterwards. In the case of ACDM, the baryon density eventually follows 
the CDM density fluctuation (dotted line), which starts growing before recombination. In the case of 
TeVeS, the baryons grow due to the potential wells formed by the growing mode in the vector field, a 
(dot-dashed line). RIGHT: The difference of the two gravitational potentials, <E> — for a wavenumber 
k = 10~ 3 Mpc _1 as a function of redshift for both TeVeS (solid line), and ACDM (dotted line). 

If this condition is met then there can exist a growing mode in a, which in turn feeds 
back into the perturbed Einstein equation and sources a non-decaying mode in $ that 
can drive structure formation. This is displayed graphically in the left panel of Figure [3] 
It is a striking result that even if the contribution of the TeVeS fields to the background 
FLRW equations is negligible (~ 10~ 3 or less), one can still get a growing mode that 
drives structure formation. 

CMB observations 

A general relativistic universe dominated by baryons cannot fit the most up to date 
observations of anisotropies in the CMB |984j . This is true even if a cosmological constant 
and/or three massive neutrinos are incorporated into the matter budget, so that the first 
peak of the CMB angular power spectrum is at the right positiorp^] This, however, is not 
proof that only a theory with CDM can fit CMB observations (as claimed in |11551ITT73"] ). 
A prime example to the contrary is the Eddington-Born-Infeld theory |70j . However, the 
linear Boltzmann equation, and the resulting CMB angular power spectrum, have been 
calculated in TeVeS, using initial conditions that are close to adiabatic [1153] . The re- 
sulting fits to the data were poor, at least for the Bekenstein free function, showing that 
CMB observations are, nevertheless, problematic for TeVeS. It may be that the new 
isocurvature modes discussed above can provide a richer phenomenology but it remains 
to be seen whether this can save this theory. 



25 In this case the third peak is unacceptably lower than the second. 
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The difference in the gravitational potentials: $ — 



The result of Dodelson and Liguori [427] have a further direct consequence: The 
perturbation equations in TeVeS that relate the difference of the two gravitational po- 
tentials, $ — V?, to the shear of matter, have additional contributions coming from the 
perturbed vector field, a. This is not due to the existence of the vector field per se, but 
comes from the disformal transformation in which the vector field plays an important 
part. Indeed, in a single metric theory where the vector field action is Maxwellian, as in 
TeVeS, there is no contribution from the vector field to $ — Now, as the vector field 
is required to grow in order to drive structure formation, it will also inevitably lead to 
growth in $ — ty. This is precisely what we see numerically in the right panel of Figure [3] 
If $ _ \jr ca n be measured observationally, then it will provide an excellent test of TeVeS. 
This possibility is discussed in more detail in section [6] 

3.5. Other Theories 

3.5.1. The Einstein- Cartan-Sciama- Kibble Theory 

In this subsection we will briefly describe the Einstein-Cartan-Sciama-Kibble (ECSK) 
theory [2591 [2601 [26T | [TT041 111051 169T] . The ECSK theory is in many cases equivalent to 
General Relativity and departs from GR only when at least one matter field has intrinsic 
spin. The ECSK theory has been reviewed by Hehl et al. [601] , and more recently by 
Trautman [I5T5] . 



The ECSK theory as a theory with torsion 



The ECSK theory is basically General Relativity with the addition of torsion. The 
connection is assumed to be metric compatible, but has non-zero torsion, and is thus 
given by 

^* = {«tf} +JP V (266) 

where K^ a p is the contorsion tensor given in terms of the torsion S ti a ^ by Eq. ( 40 ) . The 
Riemann tensor is antisymmetric in both the first and the last two indices, and hence 
the Ricci tensor is its only unique non-vanishing contraction. It is, however, asymmetric, 
and is given by Eq. (38 1. 

The addition of torsion to the connection has a direct consequence on the geometry 
of curves. In this case, autoparallels (straightest lines) are not necessarily extremals 
(shortest or longest lines) as they are in GR. The former are given by 

j2 jj, j a j p 

^ +r -!!=°- < 267 > 

while the latter are found by minimising the proper length f ^—g^dx^dx", and are 
given by 

Spin-less test particles and gauge-fields (e.g. photons) do not feel the torsion and follow 
the extremals. However, spinning test particles do feel the torsion and obey analogues 
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of the Mathisson-Papapetrou equations 868, 1024J. The notions of autoparallel and ex- 
tremal curves coincide if and only if the torsion is totally antisymmetric. The reader is 
referred to [5991 112141 [8] for further discussion. 



The action and field equations 



The action for the ECSK theory is the same as the one for the metric-affine gravity 
described in Eq. ( 46 1 , with the additional assumption that the connection is metric 



compatible. To make sure that no inconsistency arises one has to impose Q^ a f} — with 
a Lagrange multiplier. Variation with respect to r M Q(9 then proceeds in the usual way. 

We can follow a different approach, however, and assume from the start that Q^ap — 
0, and that the connection is given by Eq. (266). We can then vary the action, Eq. (46 1, 



without the Lagrange constraint, but instead take as independent variables the metric 
and the contorsion. Variation with respect to the inverse metric at constant contorsion 
then gives 



G 



P, 



pa 



where the matter stress-energy tensor is given by T Mt/ = — — 
respect to the contorsion gives 



8C„ 



-g Sgf u lit' 



(269) 



Variation with 



(270) 



where r,, a ° 

H 1 



is the spin angular momentum tensor of matter. Due to 

the anti-symmetry of the contorsion in the first and third indices we also have that 
Tan/3 — -t^q. Note, however, that the metricity assumption also means that the Palatini 
tensor simplifies to 

Pot/JtP = ^/J.a/3 + 2<7 M [ Q! 5 / 3], (271) 



which is also antisymmetric in the first and third indices. Hence no inconsistency arises 
from Eq. (2701, as it does in the general metric-affine case. Now, Eq. (2691 only 



determines the symmetric part of the Einstein tensor, but we also need the anti-symmetric 



part. Using Eqs. (271) and (38) gives the anti-symmetric part of the Einstein tensor as 



Li V ' 



(272) 



2Sfj,. Equations (269), (270) and (272) form a consistent set of field 



where V M = V 
equations for the ECSK theory. 

We can proceed one further step, however, and consider the torsion rather than the 
contorsion as the 2nd independent variable. This leads to the definition of the spin-energy 
potential, [i v a ^ = — j^F 1 . Due to the antisymmetry of the torsion in the last two 

indices, the spin-energy potential also obeys ii ua fi — —^u/3a- It is straightforward to 
show that the spin-energy potential and the spin angular momentum tensors are related 
to each other by 

Tafi/3 = — V{a0]Li (273) 

and 

fJ-Liafj = T>a/3 — Tpfta + T a/ 3^. (274) 
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Carrying out the variation this way defines a new stress-energy tensor, (Tnui a * constant 



S ^a/3 ^ 



SC„ 



This is related to T^ u by 



a 



T — 2 



S. 



a/3 



5V 



a/3 



8a Q _a B 

(n D v)a/3 



(275) 



The field equations obtained from varying the action with respect to the metric are 



+2 



" ^v)aB + " (,. 



a/3 



S, 



a/3 



= 8irGa„ 



(276) 



After some algebra, and using Eq. (272 ) to form the full asymmetric Einstein tensor, we 

(277) 



find that the Einstein equations simplify to 
where 



(278) 



is a new stress-energy tensor. This new stress-energy tensor has a very important inter- 
pretation [601] : It is none other than the canonical stress-energy tensor. Thus, in the 
ECSK theory the usual symmetrisation procedure of the canonical stress-energy tensor 
to obtain the stress-energy tensor that enters the Einstein equations is not necessary. 



Equations (277) and (270) form a consistent set of field equations that determine the 



geometry of the space-time from the matter stress-energy and spin distribution. They 
are supplemented by the conservation laws 



V7 V v — 9V P <? Q 



B\ R a 



and 



V tt r„ 



(279) 



(280) 



Let us now discuss a further property of the ECSK theory. By inspecting of the field 



equations (277) and (270) we notice that there are no derivatives of the torsion appearing 



anywhere. Thus, the torsion in the ECSK theory is non-dynamical. Its presence is 



directly given in terms of the spin-angular momentum of matter by Eq. (270). This 



means that it completely vanishes in vacuum, or in cases where matter does not couple 
to contorsion (e.g. scalar and gauge fields). Since S a „ v is algebraically determined one 
can eliminate it from all of the field equations. The final form of the field equations after 
eliminating torsion is then found to be 

G^u = SttGT^ + (8TrG) 2 ^2T /3a ^T a ^ 13 - 2r a T a ^ - T afl pT a v ^ 

+ i [2r^r afl0 + r a ^r afl0 - 2r a T a ] g^X, (281) 



where G M „ is the Einstein tensor of g M „, obtained in the usual way from the Levi-Civita 
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connection, and — t ^ 



Consequences of the ECSK theory 



We introduced the ECSK theory as a minimal modification of GR through the in- 
troduction of torsion and the application of the Palatini procedure. As was shown by 
Kibble [691) . and then later by Hehl et al. |601j . the ECSK theory emerges as the mini- 
mal description of space-time when one constructs a local gauge theory of the Poincare 
group. There are, however, a number of unsatisfactory features of this interpretation. 
Quantisation of the theory fares no better than in GR. It is still non-renormalizable (the 
spin contact interactions are reminiscent of 4-fermion interaction terms in quantum field 
theory), and the torsion has vanishing canonical momentum, which makes it hard to 
construct a quantum description. The reader is referred to [601] for more details. 

One may then ask when is the ECSK theory different from GR? By inspection, Eq. 



(281 1 tells us that the spin-potential modifications enter with an additional power of the 
Planck mass. Thus, we only expect the ECSK theory to deviate from GR at very high 
spin-densities of matter. For electrons the critical mass density such that spin effects are 
important is ~ 10 38 kg m~ 3 while for neutrons it is ~ 10 45 kg m -3 [602] . Conditions for 
such high densities can exist in the early universe. 

Cosmology with spin and torsion has been studied in the hope that the cosmo- 
logical singularity may be avoided. However, it appears that only under very unre- 
alistic circumstances can one avoid the cosmological singularity in the ECSK theory 
(see [6011 [683 ESSl [757]). 



3.5.2. Scalar- Tens or- Vector Theory 

As a matter of completeness, we will briefly present the Scalar- Tensor- Vector theory 
of gravity (STVG) proposed in [905] . STVG is a theory that contains a vector field, </> M , 
three scalar fields, G, u> and (i, and a metric, g a p. A key characteristic of this theory is 
that the modified acceleration law for weak gravitational fields has a repulsive Yukawa 
force, as well as the normal Newtonian force. The action for STVG takes the form 



S — Sorav + Sd> + Ss 



s 



(282) 



where 



SGrav 

S,p 
S s = 



1 

16tt 



d 4 3 



d 4 x, 



d^X, 



G 



(R + 2A) 



(283) 
(284) 



x v-g 



x v-g 



l 

(P 
l 

G 

1 

Jt?G 



yV„GV u G-V(G) 



(285) 



and where B.,,. = d n 



99 



This action has been studied in [905] where the full field equations, the weak field limit, 
and cosmological solutions are presented. As would be expected from the complexity of 
the action, it is difficult to completely solve this system in the detail required to make 
accurate predictions. Hence, the author of [905] has posited a certain number of scaling 
relations that translate into spatial dependences for G, to and [i. It is argued in [905] 
that the classical action for this theory could be considered as an effective field theory for 
a renormalisation-group-flow-quantum-gravity scenario. The reader is referred to [905] 
for further details. 
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4. Higher Derivative and Non-Local Theories of Gravity 



Recall from Section |2.4.1| that General Relativity represents the most general the- 
ory describing a single metric that in four dimensions has field equations with at most 
second-order derivatives [8311 [832 . One way to extend GR is therefore to permit the 
field equations to be higher than second order. Indeed, such a generalisation might be 
considered desirable as it will cause the graviton propagator to fall off more quickly in the 
UV, thereby improving the renormalisability properties. Modifying gravity in this way, 
however, also has a number of drawbacks. In particular, it can introduce instabilities 



into the theory, such as ghost-like degrees of freedom (see Sections 2.1.3 4.1.5 and 4.2.5 
and |1281j for an overview). 

In this section we consider those gravity theories that are higher than second-order in 
derivatives. Such theories can have interesting phenomenology, and in many cases can be 
shown (or, at least, argued to be) less susceptible to instabilities than one may have ini- 
tially suspected. For example, if the higher derivatives act only on what would otherwise 
be non-dynamical modes, then they may simply render them dynamical, rather than au- 
tomatically generating a ghost. This is what happens in f(R) gravity, where the higher 
order derivatives act on the conformal mode that does not propagate in GR. In Hofava- 
Lifschitz gravity, as another example, one allows for higher-order spatial derivatives, as 
opposed to higher time derivatives, in order to guard against ghost-like instabilities. In 
both of these examples the theory can deviate considerably from GR, while still main- 
taining some basic stability properties. 

This section also includes galileon and ghost condensate theories. Strictly speaking 
these are not higher-derivative theories since their field equations are at most second 
order in derivatives. In fact, the galileon theory in particular is constructed with this 
in mind. Nevertheless we include them in this section as both theories contain non- 
trivial derivative interaction terms. We will not discuss theories with infinite derivatives, 
as occur in string field theory, or p-adic string theory (see |904j for discussion of such 
theories) . 



4-.1. f(R) Theories 

Fourth-order theories of gravity have a long history, dating back to as early as 1918 
|1269j . only a few years after the first papers on General Relativity by Einstein. These 
theories generalise the Einstein-Hilbert action by adding additional scalar curvature in- 
variants to the action, or by making the action a more general function of the Ricci scalar 
then the simple linear one that leads to Einstein's equations. Here we consider the latter 
of these options, a choice that leads by Lovelock's theorem to fourth-order field equa- 
tions for anything except the addition of a constant term to the gravitational Lagrangian. 
Such theories, generically referred to as f(R) theories of gravity, have been intensively 
studied, and have a number of reviews dedicated to them [9691 111181 111671 13741 1978j . 
This interest was stimulated in the 1960s, 70s and 80s by the revelations that the quan- 
tisation of matter fields in an unquantised space-time can lead to such theories |1239j . 
that f(R) theories of gravity can have improved renormalisation properties [1 186] , and 
that they can lead to a period of accelerating expansion early in the Universe's history 
|1179j . More recently they have been of considerable interest as a possible explanation 
for the observed late-time accelerating expansion of the Universe. 
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4- 1.1. Action, field equations and transformations 

The f(R) generalisations of Einstein's equations are derived from a Lagrangian den- 
sity of the form 

C = V=gf(R), (286) 

where the factor of \f—g is included, as usual, in order to have the proper weight. This 
is clearly about as simple a generalisation of the Einstein-Hilbert density as one could 
possibly conceive of. The field equations derived from such an action are automatically 
generally covariant and Lorentz invariant for exactly the same reasons that Einstein's 
equations are. Unlike the Einstein-Hilbert term, however, the field equations that one 
obtains from the least action principle associated with Eq. (286) depend on the varia- 
tional principle that one adopts. Different possibilities are the 'metric variation' where 
the connection is assumed to be the Levi-Civita one, the 'Palatini approach' in which 
Eq. ( 286 1 is varied with respect to the metric and connection independently, and the 



'mctric-affinc' approach in which the same process occurs but the matter action is now 
taken to be a functional of the connection as well as the metric. In this section we will 
mostly be concerned with the metric variational approach, although we will also outline 
how the other procedures work below. 



Metric variational approach 



Let us now derive the field equations in the metric variational approach. Integrating 



Eq. ( 286 ) over a 4- volume, including a matter term and varying with respect to g^ v 
yields 

''\S(T^ + f R SR + |T^%„] , 



5S 



(287) 



where f R means the functional derivative of / with respect to R, \ is a constant, and 
is the energy- momentum tensor defined by a variation of the matter action with respect 
to g M „ in the usual way. Assuming the connection is the Levi-Civita one we can then 
write 



f R 6R ~ -\f R W» + f R ., P Ag^g pa - g» p g va )}5a, 



(288) 



where ~ is used here to mean equal up to surface terms 568 . Looking for a stationary 
point of the action, by setting the first variation to zero, then gives 



1 



Xr 



JrR^v - -zfg^v - Ir;iiv + g^Of R = -Tf 



(289) 



These are the f(R) field equations with which we will be primarily concerned in this 
section. It can be seen that for the special case f — R the LHS of Eq. ( 289 1 reduces to 



the Einstein tensor, and the field equations are second-order in derivatives of the metric. 



For all other cases, except an additional constant, the equations in (289) are fourth-order 
in derivatives. 



Conformal transformation in the metric variational approach 



As with scalar-tensor theories, the f(R) theories of gravity derived from the metric 
variational approach can be conformally transformed into a frame in which the field 
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equations become those of General Relativity, with a minimally coupled scalar field. This 
is sometimes referred to as 'Bicknell's theorem' in the case of f(R) theories, particularly 
when the minimally coupled scalar field is in a quadratic potential |155j . In the general 
case we consider conformal transformations of the form [1011 1853] 

9p, v = fng^, (290) 

together with the definition 

3 ln/fl, (291) 

X 

which allows the field equations from the metric variational principle, Eqs. ( 289[ ), to be 
transformed into 

1_ _ y / 1__ \X" 

R» v - -^g^vR = - \^,!M<P,u - ^g^g pa (t),p(l),a - g^V J + ^ T ^- (292) 

Here T^ v is a non-conserved energy-momentum tensor, and we have defined 



V = V(<t>) = ( RfR 2 ^ . (293) 

XI E 



Theories derived from an action of the form (286) can therefore always be conformally 
transformed into General Relativity with a massless scalar field (as long as fa ^ 0), and 
a non-metric coupling to the matter fields. 

Legendre transformation in the metric variational approach 

As well as conformal transformations, one can also perform Legendre transformations 
on f{R) theories in the metric variational approach. Such transformations allow the field 
equations of f{R) to take the form of a scalar-tensor theory (albeit it a slightly strange 
one). These transformed theories maintain the universal metric coupling of the matter 
fields, unlike the case of conformal transformations. 

The first step here is to notice that the Eq. (286) can be written in the equivalent 
form 

£ = V=g[f(x) + f(x)(R-x)], (294) 

where x is a new field, and the prime denotes differentiation. Variation with respect to 
X then gives 

f"(x)(R-x) = 0, (295) 



so that x = R f° r an fix) 7^ 0. Substitution of this result back into Eq. (294) then 
immediately recovers Eq. ( 286 ) , showing that the two Lagrangian densities are indeed 
equivalent. What is more, the special case f"{x) — can be seen to correspond to the 
Einstein-Hilbert action. 

Now, if we make the definition 

<t> = fix)> (296) 
and assume that 4>(x) is an invertible function, then we can define a potential 



A(0) = -[x(0)0-/( X (0))] 
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(297) 



In terms of this new scalar field we can then write the density ( 294 ) as 



C = yf=g[4>R-2k(<j>)], (298) 



which is clearly just a scalar-tensor theory, as specified in Eq. (97), with vanishing cou- 
pling constant, to = 0. As we have not transformed the metric, the coupling of this field 
to any matter fields that are present remains unchanged. 

The Palatini procedure 



Starting again from the density ( 286 ) we can also proceed in a entirely different way 
to the metric variational approach just described. Instead of assuming the connection 
from which the Ricci scalar is constructed is the Levi-Civita one, we could instead treat 
the metric and connection as independent fields. For the case of General Relativity a 
variation with respect to the connection then simply results in the connection being 
shown to be the Levi-Civita one, so that the difference between the metric variational 
approach and the Palatini approach is a semantic one. For the case of f(R) theories, 
however, the Palatini approach leads to an entirely different set of field equations. 



Starting with an integral of Eq. ( 286 1 over some 4- volume, and extremising with 
respect to g^ now gives 

- \g^f = f V, (299) 

where T^ v is once again the usual energy-momentum tensor. In this expression R^ v 
is now defined independently from the metric, and R should be taken to be given by 



g^R^. The next step is the variation of Eq. (2861 with respect to T fJ/ ua , which results 
in 

(V=5.9 A " y /fl) ;c7 = 0, (300) 

where the covariant derivative here should be understood to be taken with respect to 
r M j, ff , which is not the Levi-Civita connection unless fn ^constant (as is the case in 



GR). It is remarkable that the field equations (299) do not involve any derivatives of 



the metric, and only first derivatives of the connection. These are a different set of field 



equations to Eq. (289), and should be considered a different set of theories to the f(R) 



in which R is a priori taken to be constructed from the Levi-Civita connection. 



It can be noted from Eq. ( 300 ) that even if the connection is not compatible with the 
metric g^, we can still define a new metric, g^ v = fiig^v, with which it is compatible. 
Rewriting the full field equations under this conformal transformation we see that we 
recover General Relativity with a minimally coupled scalar field in a potential, but no 
kinetic term: 

C = ^[R-2V{4>)] , (301) 

where 4> = fa and V(4>) = (i?(0)</> — f(4>)) /2(f) 2 . Here R{(j)) and f(<p) are given by 
inverting the definition of <j), and R should be understood to be constructed from the 
metric connection compatible with g^ u . Transforming back to the original conformal 
frame this theory then can be shown to be equivalent to a scalar-tensor theory with 
u = -3/2 and A = (Rf R - f)/2 [533] . 

While being an interesting variant on the metric variational incarnation of the f{R) 
theories, there are a number of very severe problems in proceeding with the Palatini 
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procedure in this way. Not least of these is the apparent ill-posed nature of the Cauchy 
problem in the presence of most matter fields, which is discussed in [766 . Without a 
well-posed initial value problem f(R) gravity in the Palatini formalism lacks predictive 
power, and hence is not a good candidate for a viable theory of gravity. Furthermore, the 
Palatini approach to f(R) gravity also appears to introduce problematic strong couplings 
between gravity and matter fields at low energies [5091 1640j , and singularities in systems 
that are usually well described by weak fields [87, 88, 89 . For these reasons we will focus 
on f(R) theories in the metric variational approach in the sections that follow. 

For further details of the Palatini approach to f(R) gravity, and the results that 
follow from it, the reader is referred to |1167j . For studies of weak field gravity in the 
Palatini formalism the reader is referred to [92l 14291 I88T1 15551 1351 110001 [TTMl [TOTITI IT0781 
OH IMS ESI HP77] . and for cosmology to [12311 15051 15551 CM [5521 MM HH ITTM1 
17231 Hfl HS5J [75J 1T2521 17231 17511 17521 \WT\ . An interesting class of theories that 
interpolate between the Palatini approach to f(R) theories and the metric variational 
approach to f(R) theories is investigated in [351 I726[ . 



The metric- affine approach 



One further approach to the f(R) theories of gravity is the 'metric-affine' formula- 
tion. Here one again considers the metric and connection to be independent, as in the 
Palatini procedure, but now allows the matter action to be a function of both metric and 
connection (rather than metric alone, as is the case in Palatini and metric variational 
formalisms). The relevant action for the theory then becomes |1168j 

S= j dn^gf(R) + S m (g f ^ 1 T^ a ^), 



(302) 

where R — g^Ruv, and i?„„ is taken to be a function of the connection only, as in the 



Palatini procedure. One can therefore think of the action (302 1 as a generalisation of 



the Palatini action, which is recovered when the dependence of the matter action, S m , 
on T^ ua vanishes. 

As is the case in General Relativity, the invariance of the Ricci scalar under the 
projective transformation Y iL va V^ va + X v 8^ a can lead to inconsistency of the field 
equations, as matter fields do not generically exhibit an invariance of this type. This 
invariance can be cured by adding to the action an additional Lagrange multiplier term 
of the form S — J dfi^-jFA"^^ , and results in the field equations 



1 x 

JrR^iv ~^9iivj ~^Tp,v, 



(303) 



together with r^r , = 0, and 
1 



p,U\ 



op] 



A W A a \ v K^ 

P 3 a p 



where A n vp 



-(2/ ' sf zz g)8S m /5Yt l vp . n can then be shown that A, 



(304) 



corresponds 



to a vanishing of the torsion, and A M =^ introduces non-zero non-metricity [1168j . 
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The metric-affine approach has not been studied as intensively as the other approaches to 
f(R) gravity that we have already discussed, and will not feature heavily in the sections 
that follow. 



4-1-2. Weak- field limit 

The weak field limit of f(R) theories of gravity has been extensively studied in the 
literature. Here we will first consider perturbations about Minkowski space, and then 
perturbations around dc Sitter space, time dependent backgrounds, and inhomogeneous 
backgrounds. 



Perturbations about Minkowski space 



The first attempt at finding the Newtonian limit of an f(R) theory appears to have 
been performed by Pechlaner and Sexl for the case of / = R + aR 2 [1029] . The first step 
here is to write down the perturbed line-element as 



ds 2 



(l + 2^)dt 2 + {l-2<t)){dx 2 



neglecting time derivatives and second order terms in 
Ricci scalar, R = ~2Aip + 4A0, obeys 

6aSR — R = — — p, 
2 y 

and that the potential i\i obeys 

(1 + 6aA)Aip = y(l + 8aA)p, 



dy 2 +dz 2 ), (305) 
and ip one then finds that the 



(306) 



(307) 



where A is the Laplacian on Euclidean 3-space. The derivatives of p in this last equation 
occur due to replacing terms containing <f> with those obtained from taking derivatives 



of Eq. (306), and do not occur in the actual field equations themselves. 



Inserting a delta function source, p = mS, and integrating Eq. (307), using the 

one then gets the solutions [1029J 



solution to Eq. ( 306 1 to find 

167T0 



r 
\m 
r 



1 + 



1 - 



(308) 
(309) 



where boundary conditions at infinity have been imposed to eliminate exponentially 
increasing modes, and where we have defined the mass 



1 

6a 



(310) 



Mass terms similar to this continue to exist for more general theories, and more general 
situations, as we will outline shortly. One can see straight away that for large masses, with 



mo ^> r , the Yukawa potentials in Eqs. (308) and (309) are exponentially suppressed, 
and we recover ip = <f> oc —m/r. In the limit of small masses, however, when mo r^ 1 , 
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we instead find that -0 = 2<fi oc — m/r. For the case of / = R + aR 2 gravity we therefore 
already expect the PPN parameter 7 to be 1 when the mass of the scalar degree of 
freedom is large, and 1/2 when it is small. 

One is, of course, also interested in other functions of f(R), and can consider the case 
of analytic f(R) theories that can be expanded as 

00 

J(i?) = ^ Cl i? 1 , (311) 

i=l 

where the Cj are constants. To full post-Newtonian order the weak field solution in the 
presence of a perfect fluid is then given in full generality as |307j 



-(U + c 2 R) - \\J 2 + 2 C 4r 2 ^UR 

Cl Cf C\ 6C\ 187TC1 



-^V(Vp • VR) + -V(pIT) + -V{ P v 2 ) + -V{P) + X{UR) 

C\ Cl Cl Cl D7TC1 

X(R 2 ) - ^X(pU) + ^X(pR) 
8c 2 

2 X(Vp ■ VU) £%(Vp ■ VR) - —X(P) + ^X(pU) (312) 



-L (* - ^ ) X(R>) - *X(pU) + £ 
47r V ci 2c 2 / 3c 2 v 3c 2 

f^X(Vp • W) - 8 4x(Wp ■ VR) - ^X(P) + £ 
3cf cj ci 3ci 



2ci 2ci 6ci 6ci 6\/6cic 2 
9ij = (l + ^-(U- c 2 R)^j S tj (314) 

where U is the usual Newtonian potential, and the other potentials are defined as 

Vt = j ^^ d 3 x' Wi = j flx,)(v( ^ x ;p ))( " , ' , fe' 

x(Q)^j Q,e ~iS>r' l d3x/ * = / p,v 'x-^~ x ' )i e ~^~^ d3x ' 

X = fp>e-^ l *-*' l d 3 X > R=irJ 0^e~^-^d?x', 

where R is the Ricci scalar, P is pressure, p is the rest-mass energy density, and II is 
the specific energy density (as defined in |1274j ). The terms in Eqs. (313 )-( 314 1 that are 
functionals of derivatives of p, U and R can be recast into a form where such derivatives 
do not appear by further manipulations [307J . 

From the above it can be seen that the results of General Relativity are recovered, 
to the appropriate order, when c 2 — >• c 3 —¥ 0. For non-vanishing values of the these 
constants a large number of extra Yukawa potentials are present, and for large enough 
values of c 2 we can again see that 7 —> 1/2, as the scalar degree of freedom becomes 
massless. 
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The study of gravitational waves about a Minkowski space background in f(R) theo- 
ries has been undertaken by Berry and Gair in |146j . Here the authors consider analytic 
functions of the type prescribed by Eq. (311), and find that an extra mode of oscil- 
lation is possible. The gravitational waves emitted by extreme-mass-ratio inspirals are 
then calculated, and the authors conclude that current laboratory bounds (that result in 
|ca/ci| < 10~ 9 m 2 ) mean that the extra oscillatory mode they find cannot be excited by 
astrophysical sources. 



Perturbations about de Sitter backgrounds 



As well as the usual expansions about Minkowski space, in order to determine the 
post-Newtonian behaviour of a theory, one is also interested in perturbations about other 
backgrounds. Here we will consider de Sitter space as a background. This is not meant to 
be an elucidation of cosmological perturbation theory, but rather a consideration of weak 
field expansions as applicable to systems such as the solar system and binary pulsars. 
Minkowski space is not always a stable solution of f(R) theories of gravity that attempt 
to produce self- accelerating cosmologies at late-times, and in these cases time-dependent 
backgrounds, and de Sitter space in particular, become of increasing interest for weak 
field studies. 

Much work has been performed on establishing the weak field limit of f(R) theories 
about a de Sitter background, as relevant for theories that try and account for late- 
time accelerating expansion without dark energy, see e.g. |1050| 14211 I117R 19431 1998| 
111291 113001 1930j . The majority of these studies conclude that, in the absence of extra 
mechanisms to mask such behaviour, one should expect to find 7 = 1/2. This was shown 
in an early paper on the subject in |292j . and is the familiar limit of theories in which 
a scalar degree of freedom has low mass. Here we will briefly sketch out the method by 
which such a result is found for more general f(R), following the approach of [294]. 

The first step here is to identify a de Sitter solution with constant Ricci curvature 
R = Rq = 12Hq. The line-element is then perturbed as 

ds 2 = -(1 + 2ip - H^r 2 )dt 2 + (1 - 20 + Hlr 2 )dr 2 + r 2 dn 2 , (315) 

where we have chosen to present the de Sitter background using a static coordinate patch. 
We then proceed by perturbing the Ricci scalar as 

R = R + R 1 , (316) 

where i?i <C Rq- The perturbative expansion then linearises all field equations with 
respect to <fi, ip, R\ and H^r 2 , and their derivatives, while neglecting all time derivatives. 
The case of spherical symmetry is considered for simplicity. 
To lowest order the trace of the field equations is now 

ARi _ f f R - f RR Ro \ Ri = _ w (317) 

\ oJrb, / VJRR 

where fn and fiw should be understood to be the value of these quantities at R = Rq 
(implicit here is an assumption that these quantities are all of the same order of magnitude 
as Rq, and that R in the weak-field systems under consideration takes the same value as 
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in the cosmological background solution). From cosmological considerations the second 
term on the left-hand side of Eq. ( |317[ ) is then neglected, as the factor in brackets 
corresponds to the mass squared of the scalar degree of freedom, which must be small 
compared to in order to have late-time accelerating expansion. The resulting form 
of i?i is therefore found to be 

* K sSb? < 318 > 

where m is the mass of the object at the centre of symmetry. Applying the same approx- 
imations to the (t, t) component of the field equations results in 

*JR * JR 

which on substitution of the expression for Ri gives to lowest order 

The remaining field equations then give 4>, to the same order of approximation, as 

Xm tp 



2Airf R r 2 



(321) 



This calculation has not been performed in the PPN gauge, which uses an isotropic 
spatial coordinate system, but nevertheless one can verify that when interpreted within 
the standard PPN framework it does indeed give |1001j 

7 = ~. (322) 

In this section we have discussed de Sitter space as a background to perturb around. 
However, establishing whether de Sitter space is, in fact, a stable asymptotic solution of 
f{R) theories, and establishing the genericity of initial conditions that lead to de Sitter 
space at late times, has not yet been discussed. We will consider this subject in following 
sections. 

Perturbations about other backgrounds 

Having considered the maximally symmetric Minkowski space and de Sitter space 
backgrounds, we can now also consider less symmetric spaces to perturb around. This 
enterprise is hindered by our ability to find less symmetric solutions to the field equations 



(289). We can, however, make progress with some simple cases. 

If we consider f(R) theories in which / = R 1+ then one can find exact non-static, 
homogeneous and isotropic vacuum solutions [241J. Such solutions can be shown to be, 
under certain conditions, stable asymptotic attractors for the general class of spatially 
flat, vacuum FLRW solutions |248j . They will be discussed further in the cosmology sec- 
tion that follows. For this same class of f(R) theories exact static, spherically symmetric 
vacuum solutions are also known |310j . which can also be seen to be generic asymp- 
totes of the general solution, with the specified symmetries applied |310| . We are now in 
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possession of two exact solutions, for the same theories, which have less than maximal 
symmetry, and which can be used as backgrounds to perturb around. 

Spherically symmetric, time independent perturbations around the homogeneous, 
time-dependent background found in [241] are given to linear order by 



ds 2 = -(1 + 2i\i)dt 2 + a 2 (t){l - 2cj))(dr 2 + r 2 dft 2 ), 

5(1+25) 

where a(t) = t ^-^ . The perturbations <j){r) and ip(r) are then found to be 

ci 2c 2 (l - 66 + 4S 2 +4<5 3 ) 2 
^ ~ ~7 + (5 - US - US 2 ) r 
(1 - 25) Cl a 

= 0C2T . 

r 

The corresponding perturbative analysis about the static, spherically symmetric back- 
ground found in [310] gives 

ds 2 = -r 2S T^T(l + V(r))dt 2 

(1 - 26 + 46 2 )(1 - 25 - 28 2 ) . „„ . . , 2 2 Jo2 
-(1 + W(r))dr 2 +r 2 dil 2 , 



(1-S) 2 

where V(r) and W(r) are given in full generality by V(r) = c 3 Vi(r) + c^V^r) + c$Vz(r) 
and W(r) — —c 3 Vi(r) + c i W2(r) + c 5 W 3 (r), where 

(1-25+45 2 ) 

Vi = -r <t^^ (323) 

and where V2, V3, W2 and W3 are oscillatory modes |310j . It can immediately be seen 
that the form of the linearised perturbations around these two backgrounds are quite 
different to each other, even though the field equations they obey are identical. One can 
verify that an observer in the homogeneous, time-dependent background should measure 

(3 = 1 and 7 = 1 - 26, 

which gives 6 = —1.1 ± 1.2 x 10~ 5 when the constraint derived from the Cassini space 
probe on 7 is applied |147j . On the other hand an observer in the static background 
should measure an anomalous extra gravitational force that goes like |310j 

r 

When subjected to constraints from observations of the perihelion precession of Mercury, 
the presence of this extra force gives 6 = 2.7±4.5x 10 -19 |310j . The different forms of the 
gravitational potentials and forces in these examples show that the choice of symmetries 
of the background space-time can have important consequences for its weak field limit, 
and the constraints on the underlying theory that are derived from it. 

Chameleon mechanism 
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As with a variety of other modified theories of gravity, the 'Chameleon Mechanism' 



has been applied to f(R) theories. This mechanism was outlined in Section 3.1.4 where 
a summary of some of the accumulated literature on it was outlined. Here we will simply 
reiterate the basic point that this mechanism potentially allows a means by which theories 
with a light effective scalar degree of freedom can evade solar system and binary pulsar 
tests of the PPN parameter 7 by allowing the scalar to acquire a higher mass in the 
locale of high mass concentrations, such as the Sun and Earth. 

The chameleon mechanism has been applied to specific f(R) theories, and its be- 
haviour in this application considered further, in e.g. [H71 15351 111501 13551 H531 ITT551 



244, 629 . As with other applications of the Chameleon mechanism, if a 'thin shell' is 
present then the mass of the scalar degree of freedom in these theories is thought to be 
able to be supressed enough to satisfy solar system tests of gravity. 

4-1-3. Exact solutions, and general behaviour 

Having discussed the weak field solutions which arc of interest for inferring constraints 
on f{R) theories from observations of gravitational phenomena in the solar system and 
binary pulsar systems, let us now consider the behaviour of solutions to the full non- 
linear field equations. Here we will be concerned with exact solutions, which can be 
obtained in some simple cases, as well as what can be inferred about the general be- 
haviour of non-linear solutions by other methods, and what theorems can tell us about 
the behaviours that are possible. The relatively simple structure of f(R) theories make 
such considerations a feasible proposition. The geodesic deviation equation in general 
f(R) theories has been studied in 



Isolated masses, and black holes 



Progress was made into understanding the static spherically symmetric vacuum so- 
lutions of f(R) theories of gravity by Mignemi and Wiltshire in |891j . These authors 
consider theories with higher powers of the Ricci scalar added to the Einstein-Hilbert 
Lagrangian, and use a dynamical systems analysis to determine the behaviour of the gen- 
eral solutions with the specified symmetries. They find the asymptotes of these solutions, 
for a variety of different cases, and show that the only static spherically symmetric solu- 
tions of the theories they consider that have regular horizons are the Schwarzschild solu- 
tions. They further find that by dropping the requirement of regularity the Schwarzschild 
solution is also the only solution to these theories that is asymptotically flat. 

The black hole 'no-hair' theorems have been considered in the context of f(R) = 
R + aR 2 theories by Whitt |1270j . Collapse to a black hole, however, has not been as 
extensively studied in f(R) theories of gravity as it has in Brans-Dicke theory, where 
direct numerical calculations have been performed |1107j . Nevertheless, the same logic 
that tells us that the vacuum black hole solutions of general relativity arc the only vacuum 
black hole solution of Brans-Dicke theory that can result from gravitational collapse, also 
suggest that this should be true for f(R) gravity. In particular, most of the results of 
Hawking on this subject only rely on inequalities of the form 

R^Fl" > 0, (324) 

where I 11 is a null vector, and not on the details of the field equations themselves [555] . 
This null energy condition is true of the conformally transformed scalar fields equations 
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in Brans-Dicke theory, and is also true in f(R) gravity. It therefore seems reasonable 
to expect that the vacuum black hole solutions of f(R) gravity should also be the same 
as the vacuum black hole solutions of General Relativity. The subject of black hole 
radiation in the context of f(R) gravity has been studied in [2001 13221 [TBI 15431 1378j and 
|489) where it was shown that black holes in f(R) gravity have a entropy given by 



S = 



f R A 



(325) 



The subject of the de Sitter no-hair theorems and isotropisation in f(R) gravity has been 



considered by Barrow and Ottewill 110 and Goheer, Leach and Dunsby [540], where 
it was shown that flat FLRW isotropic points can exist in the phase plane of Bianchi 
solutions, and that de Sitter space can be a stable asymptote of f(R) theories of gravity. 
One should, however, be aware that such behaviour depends on the theory in question, 
and the initial conditions. For example, for theories with negative powers of R in a 
series expansion of their Lagrangian one may generically expect such terms to become 
important asymptotically. In this case accelerating power-law expansion is an attractor 
solution 306 . This will be discussed further in the cosmology section below. 

As well as the black hole solutions of General Relativity, it is known that other 
vacuum solutions to f(R) theories of gravity that can describe isolated masses also exist. 
Due to the complicated nature of the field equations in these theories, however, only 
a few exact solutions that describe these situations have been found. For the case of 
f(R) = R 1+s solutions are known that correspond to an isolated mass in a homogeneous 
and time dependent background, and an isolated mass in a static, spherically symmetric 
background. The former of these solutions is given by the line-element [304J 



ds 2 



-A^dt 2 + a 2 (i)Si(r)(dr 2 + r 2 dn 2 ), 



(326) 



where a(t) — t {1 ~ 5) , and where A\{r) and Bi(r) are given by 



Mr) 



1 - 



Ci \ i 



and Lhln ( 1 + — ) Air) 



9+25-1 



where q 2 = 1 — 26 + 4<5 2 . The latter solution is given by [310] 



where 



A 2 (r) = 
B 2 (r) = 



ds 2 = -A 2 (r)dt 2 + 



25 ( , 1 + 2 /' , C 2 



dr 2 



r 2 dfl 2 



(1-4.5) ' 

r 

(1 - 26 + 46 2 )(1 - 25{1 + 6j) 



1 + 



(1-25+4.5 -*) 

r t 1 - 5 ) 



(327) 



The constants C\ and C2 appear in these solutions as mass parameters, and it can be 
seen that both Eq. (3261 and Eq. (327) reduce to the Schwarzschild solution when 
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S — >■ 0. The problem of static, spherically symmetric solutions in general f(R) has been 
considered in |992| , where a covariant formalism was developed for studying the problem, 
and the non-uniqueness of the Schwarzschild solution was demonstrated. The 6 = 1/4 



case of Eq. ( 327 ) was rediscovered in [243J . Black holes coupled to Yang-Mills fields have 



been studied in [872], where an exact solution was found for the case / = yR. 

These solutions are interesting for a number of reasons. Firstly, they show that the 
generalisation of the solutions of General Relativity to other theories of gravity is not 
always unique; i.e. there can be multiple solutions in modified theories of gravity that 
reduce to the same solution in the limit of general relativity. It may therefore be the 
case that one needs to understand the symmetries of the background space-time to a 
greater extent than is necessary in General Relativity, in order to fully understand which 



solution should be used to model a given situation. Secondly, Eq. ( 326 1 shows explicitly 
that Birkhoff 's theorem is not valid in general, when one considers generalisations of Ein- 
stein's theory. Spherically symmetric vacuum solutions of modified theories of gravity can 
therefore be time-dependent, which can lead to new phenomenology. BirkhofFs theorem, 



in the context of f(R) gravity, has been considered in [49(1. Thirdly, Eq. (327) displays 
non-trivial asymptotic behaviour as r — ¥ oo. Such behaviour is unexpected in general 
relativity, and again opens the window to new phenomenology. The results of Mignemi 
and Wiltshire [891] even suggest that such behaviour is generic. A fourth point is that 



the solution given in Eq. ( 326 1 has been shown in |488j to exhibit a naked singularity. 
This has clear implications for the applicability of the cosmic censorship hypothesis to 
modified theories of gravity. The Misner-Sharp energy in spherically symmetric space- 
times is considered in |225j . 

Cosmological solutions 

A variety of cosmological solutions in f(R) theories of gravity are known, and have 
had their stability analysed. Here we will briefly review and provide references to studies 
of these solutions. 

The conditions for existence and stability of de Sitter solutions in f(R) gravity appears 
to have been first studied in |110) . One can show that for any theory for which there is 
a value of R which satisfies 

f R (R d s)Rds = 2f(Rds) (328) 

there exists a de Sitter solution with Rds = 4A. The stability of de Sitter solutions in 
f(R) gravity was studied in [110, 1049, 486 . These solutions are of obvious importance 



for cosmology at both early and late times. One can note that with f(R) oc R 2 Eq. (328) 
is satisfied with any value of R. Theories with R 2 terms in their Lagrangian's have been 
studied extensively, due to the naturalness of adding an R 2 term to the Einstein-Hilbert 
action, and due to their improved renormalisation properties [1186] , They were also 
introduced and studied by Starobinsky for cosmological purposes, and in particular their 
ability to give rise to an early non-singular period of accelerating expansion in a natural 
way [TT79] . 

Less symmetric cosmological solutions than de Sitter space can also be found for some 
f(R) theories. In particular, theories of the type f(R) = R 1+s are again of interest here, 
as they admit simple exact solutions. As mentioned in the preceding section, a power-law 



113 



exact solution for a spatially flat vacuum FLRW solution is known to be given by |241j 



a{t)=t s1 ^-, (329) 

and a spatially flat solution in the presence of a perfect barotropic fluid with equation of 
state P — wp is also known |248j 

2(1 + 5) 

a(t)=tV+^. (330) 

The stability of these solutions, and their properties as asymptotes of the general solution, 
have been investigated in |248j and |310j . In fact, it has been shown that these are the 
only power-law perfect fluid FLRW solutions that exist for any f(R) gravity theory 
|539) . Explicit non-power-law general solutions with FLRW symmetries were found in 
|305) . both with spatial curvature, and in the presence of a perfect fluid. These solutions 
show explicitly that in the early universe both non-singular and inflationary behaviour 
are possible. The energy conditions in FLRW solutions have been considered in |1033j . 
and braneworld cosmology in these theories have been considered in |68j . 

Beyond exact solutions, FLRW cosmological solutions have also been studied in f(R) 
theories of gravity using dynamical systems analysis. This has been done for the case of a 
number of particular f(R) theories in 2481 13101 17901 151 153"5] and also in the general case in 
|390| 13061 l40l I253j . The dynamical systems approach has even been applied to perturbed 
FLRW solutions in |247j . We will discuss perturbed FLRW further in the Cosmology 
section that follows. These studies find a variety of interesting cosmological behaviours 
at both early and late-times. In particular non-singular and accelerating behaviour in 
the early universe is again identified, as well as late-time accelerating expansion, and the 
non-sequential domination of higher powers in the Ricci curvature, for analytic f(R), as 
the initial singularity is approached. The conditions required for a non-singular 'bounce' 
are given in [249] . and oscillating solutions were considered in |208j . There have also 
been some concerns expressed as to whether a matter dominated epoch is generically 
expected to exist after radiation domination [242, 972, 40, 431 1479j . The inverse problem 
of finding particular forms of f(R) that result in pre-specified cosmological evolutions 
has been considered in [2401 E2S1 EEl [3761 EZ2 EOH EEO E8QJ- Such inversions do 
not always specify f(R) uniquely 929 , and it has been shown that to reproduce exact 
ACDM evolution with dust only one is forced towards the Einstein-Hilbert action with 
a cosmological constant |443j . 

Exact Bianchi cosmological solutions were discovered for f(R) = R n theories in [9"5] . 
and been studied further in |778[ I540j . where shear dynamics and isotropisation are 
discussed. The special case of n — 2 was studied in |209) . Bianchi type I and V 
solutions have been considered in |1132j and |1133) , and Bianchi VII a solutions in |367j . 
Kantowski-Sachs solutions have been studied in [786] . Other know exact solutions are 
the Einstein static universe [HI UM \EM ITT23] , and the Godel universe [HI [1057] . 
These studies explore the stability of the Einstein static universe, and the existence of 
closed time-like curves in the Godel solution. 

4- 1-4- Cosmology 

Much of the recent motivation for studying f(R) gravity has come from the need to 
explain the apparent late-time accelerating expansion of the Universe. Previous motiva- 
tion for studying f(R) gravity has also come from cosmological considerations, including 
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the presence of an initial singularity, and early universe inflationary expansion. We will 
therefore now present an overview of what we consider to be some of the most relevant 
aspects of f(R) gravity for physical cosmology. In terms of the viability of FLRW ge- 
ometry in f(R) gravity, the Ehlers-Geren-Sachs theorem of General Relativity has been 
extended to cover these theories by Rippl, van Elst, Tavakol, and Taylor in [1065], and 
more recently by Faraoni in [48 7] . 

Field Equations 



To describe the cosmology, up to scalar perturbations, we first define line-element 

ds 2 = -(1 + 2^)dt 2 + a 2 (t)(l - 2$)q ij dx i dx i , (331) 

and the energy-momentum tensor, 

T° = -P-Sp (332) 

(333) 



T°i = -{p + P)V t 9 



5PS% + {p + P)D l 1 E, 



(334) 



where 9 is the peculiar velocity, SP is the pressure perturbation, and £ is the anisotropic 
stress. At zeroth order the Friedmann equations are 



H 2 



1 

3F 



1 



8ttp - -(/ - RF) — 3HF 



K 

~n2 



1 



H = - — {87rp + 8nP + F-HF) 
2r 



K 

~7a 



(335) 
(336) 



where the Ricci scalar is given by R — 6(2H 2 + H + n/a 2 ), and energy-momentum 
conservation gives, as usual, 

p + 3H(p + P) = 0, (337) 

where F = fa, over-dots denote derivatives with respect to t, and k is spatial curvature. 
Now let us consider the first-order scalar perturbation equations, which are given in 
Here it is convenient to define a new quantity 

(338) 



X = + 3$. 
The perturbation equations are then |637j 



X + 
1 

2F 



2H 



F 
2F 



X - 



3F- 

* 

2F 



3H 



— (2F + HF)-X 
2F y ' a 2 



inSp + 2Att6P + 36F + 3HSF 



6k 



- 6H 2 SF 



(339) 



and 



SF + 3HSF 



r,2 



SF 



8?r 



(Sp - 35P) + F(x + *) + (2F + 3HF) * - ^SR 
115 V 



(340) 



with fluid evolution equations 



Sp + 3H(Sp + SP) = (p + P)[x- 3F* 



kH_ 

a 



and 



(a A {p + P)key 
a 4 (p + P) 



1 



(P + P) 



SP--(k 2 



3k)(p + P)E 



Here the perturbation to the Ricci scalar, SR, is given by 



SR 



X + 4H X 



k 2 



(fc 2 - in) 



and we have the constraint equations 



3 -F 3 

\Jf = 

2F 2F 



8ira{p + P)9 + SF-HSF 



(341) 



(342) 



(343) 



(344) 



and 




Furthermore, we again have that ^ $, in general. Instead it is the case that 

87ra 2 (p + F)S 



F 



F 



(345) 



(346) 



The equivalent equations to those given above can also be derived in the covariant ap- 
proach to cosmological perturbation theory |250| . In the rest of this section we will 
consider the consequences of these equations for various cosmological phenomena. 

Inflation 

The existence of inflation has provided considerable motivation for the study of f(R) 
theories of gravity. The pioneering work on this subject was that of Starobinsky in 1980, 
who found that theories with R 2 corrections to their gravitational Lagrangian can have an 
early period of de Sitter expansion [1179] . The spectrum of scalar and tensor fluctuation 
generated during this type of inflation have been studied in [9161 111781 17191 1636) where 
they were found to compatible with observations of the CMB. Quantum initial conditions 
("tunnelling from nothing"), as well as the process of reheating, were also considered in 
PI] , 

Inflation in f(R) gravity is particularly transparent in the Einstein conformal frame. 
Here, for the Starobinsky model with |1179j 



f(R) = R + 
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R 

6M 2 ' 



(347) 



the conformally transformed theory in vacuum is one in which the minimally coupled 
scalar field exists in a potential 

V(4>) = ^ (l - e-vT*) . (348) 

This potential is displayed in Figure [4j where it can be seen that slow-roll inflation is 
likely to occur in the region ip > m p i , and reheating is feasible during oscillations around 
the minimum at <f> = 0. This is, of course, exactly the type of behaviour that one wants 
for a viable inflaton field. 
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Figure 4: The potential given in Eq. J348J , normalised by its asymptotic value as i 



In fact, for the theory specified in Eq. (347) one can show that inflation is the 
transient attractor of the general solution 852 , and that in the region (f> 3> m p i slow-roll 
inflation occurs with 

H M 2 , s 

£ = -i^6^' (349) 
and proceeds for N ~ (2e) _1 e-foldings. We will not proceed with showing the details 
of reheating in this model, but only note that around 4> ~ the potential given in Eq. 



(348) is well approximated by V ~ ^AI 2 (f> 2 . For details of how reheating occurs in this 
potential the reader is referred to [12491 [892 . Pre-heating in f(R) inflationary models 
has been considered in [1222J. Quantum cosmology, instantons, and their implications 
for inflation have been studied in |1246| 11027] . 

Dark Energy 

As well as accelerating expansion in the early universe, f(R) theories of gravity are 
also capable of producing late-time accelerating expansion. There have been a large num- 
ber of papers on this subject. There have also been attempts to construct quintessence- 
like f(R) models which produce both early and late acceleration [9741 19761 WT71 19751 152] . 

An easy way to see the potential for late-time accelerating expansion is to consider the 



Friedmann equations (335) and (336) in vacuo. One can then identify effective density 
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and pressure parameters by analogy to the Friedmann equations of general relativity. 
These are 



871-peft = 

8i:P cff = 



RF - f - 6HF 
2F 

2F + AHF + / - RF 
2F 



The equation of state of this effective fluid is then given by 

2F + AHF + / - RF 

W = ; , 

RF- f - 6HF 



(350) 
(351) 

(352) 



and one can then determine what is required to achieve w < —1/3, and hence accelerating 
expansion. 

One example of this is the now much considered theory of Carroll, Duvuri, Trodden 
and Turner |255j 



f(R) = R — 



A 4 



2(n+l) 



R" 



(353) 



where \x is a constant. For power-law evolution the effective equation of state, (352), is 
then given at late-times by |255) 



w 



-1 



2(n + 2) 



3(2n+l)(n+l)' 



(354) 



so that if n — 1, and the extra term in the gravitational Lagrangian is inversely propor- 
tional to R, then one achieves an equation of state with w — —2/3, and hence accelerating 
expansion. In fact, this just corresponds to the power-law solution given in Eq. (329), 
for a theory with f(R) <x R~ n . 



The theory specified in Eq. ( 353 ) is now known to have a number of deficiencies that 
make it non- viable [9991 S3 14791 14281 14851 12581 14731 [TlMl H22l [TTBT] . Some of these 
have to do with the weak-field limit, which we have already discussed, others come from 
cosmology, and yet more are due to stability issues, which we will address in Section 
|4.1.5| Many of these problems can be traced back to the value of the effective mass in 
the scalar degree of freedom of this theory, which is thought to be either too small for 
validity of gravitational physics in the solar system, or imaginary, leading to some of the 
instabilities just alluded to. Models which have been constructed to try and over-come 
these problems, while still leading to accelerating expansion at late-times, are those of 
Starobinsky [HSU] : 



f(R) = R-{iR c 



Hu and Sawicki |629j : 



f(R) = R- 



yRc 
l + {R/Rc 



and Battye and Appleby 



f(R) =R + R C log + (1 - e'^e 
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-ft/ Ac 



(356) 



(357) 



where fx, n and R c are all positive constants. Attempts to construct viable models that 
include an early stage of inflationary expansion, as well as late-time accelerating expan- 
sion, have been made in 321 , 468 . All of these theories rely on the chameleon mechanism 
to satisfy solar system constraints on gravity. 



Observational Probes 



As with many modifications to gravity, cosmological observables can be used to con- 
strain f(R) theories of gravity. Here we will briefly survey the literature on this subject. 

Primordial nucleosynthesis has been studied in f(R) gravity in [3101 12041 17641 [475] . 
Due to the conformal equivalence between these theories and general relativity, the be- 
haviour of cosmological solutions during the radiation dominated epoch are considerably 
simplified: They evolve in a similar way to the radiation dominated solutions of GR, but 
with a different value of the effective gravitational constant. This situation is familiar 
from studies of primordial nucleosynthesis in the scalar-tensor theories of gravity out- 
lined in Section [3~T| In the present case the relevant effective gravitational 'constant' is 
inversely proportional to f R . The value of f R evolves throughout the matter dominated 
and accelerating epochs, but is constant during radiation domination. Observations of 
the abundances of light elements then provide constraints on the allowed values of f R 
during the radiation dominated epoch, and hence constrain the rate of evolution of this 
quantity that is allowed during the rest of the Universe's history. 

Other probes of the background expansion of an FLRW universe are the peak posi- 
tions of the CMB spectrum of temperature fluctuations, and baryon acoustic oscillations. 
Observations of these quantities allow the form of a(t) to be constrained, but due to the 
freedom in the choice of f(R) are not able to falsify the most general form of these the- 
ories directly [MB \ZM EZ3 EZ2 EMI E5D EHB1- To go further using cosmological 
observations we must therefore consider the solutions to the perturbation equations given 
above. 

The first thing that one may wish to consider is the growth of density perturbations, 



S = Sp/p. In a spatially flat universe, manipulation of Eqs. ( 339 )-( 346 1 allows one to 
write [™rT2T51IT2^6] 

s + 2HS ~JK (i + (a/k)n>P) =0 ' (358) 

where the mass parameter M is given, just as in the weak field limit discussed in Section 
|mi as 

M2= f R -Rf RR (359) 

3jRR 



From the third term on the LHS of Eq. (358) it can be seen that the evolution of 



S depends on the magnitude of M, and, in particular, is different in the two regimes 
M 2 3> k 2 /a 2 and M 2 <C k 2 /a 2 . When the former is true, the density perturbations 
evolve as they do in General Relativity, with an effective Newton's constant given by 
G = 1/ f R . For a matter dominated universe this means 

5octi (360) 
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In the latter regime, in which M 2 <C k 2 /a 2 , this is no longer true. Here, the third term 
in Eq. (358 1 is modified from its form in GR by a multiplicative factor of 4/3, and the 
evolution of 6 during the matter dominated era is consequently modified to 



Joc ^-i)/6_ (361) 

The transition between these two limits is theory dependent. For studies on this subject 
the reader is referred to [TTgOl 15251 IT221 13771 [12151 [12261 IT0T51 IT2201 15151 [T22T1 15T41 [5JD] • 

Interesting results are that the change in evolution between the two regimes discussed 
above is scale dependent. That is, modes with different wave- numbers can evolve in 
different ways depending on whether they are larger or smaller than a M . This length 
scale is therefore imprinted on the density perturbations. Furthermore, oscillating modes 
can also become present when M 2 ^> k 2 /a 2 , which can lead to undesirable singularities 
(515] . The inclusion of an R 2 in the gravitational Lagrangian was found to remove these 
singularities in |57j . One can also see that fan > is need for the stability of scalar 
modes. 

The modified growth of structure just discussed has consequences for large-scale struc- 
ture, and the cosmic microwave background, which we will now discuss. The matter 
power spectrum in f(R) theories of gravity has been considered in |4951 12581 111021 [T221 
[7241 [TTBTl 17901 1461 1451 18T411T026] . and cluster abundances have been used to constrain 
these theories in [11151 1818, 500J. The formation of non-linear structure has also been 
considered in [6301 110061 110071 111141 11303j . Cosmic microwave observations are consid- 
ered in |7901 111581 111571 1818) where it is shown that power on large-scales is sensitive 
to the modified growth of structure through the integrated Sachs- Wolfe (ISW) effect. 
This can lead to damped power for small deviations from GR, or amplified power if the 
deviations are large enough. Correlating ISW effects in the CMB with observations of 
galaxy number density also leads to tight constraints [11581 1818] , due to the sign of the 
CMB temperature fluctuation changing if the modification to gravity is large enough. 

4-1.5. Stability issues 

There are a variety of stability issues that are of concern for f(R) theories of gravity. 
These include ghost degrees of freedom, as evidenced in the Ostrogradski instability, as 
well as the instabilities found by Frolov, and Dolgov and Kawasaki. Some of these issues 
have been mentioned already. In this section we will discuss them further. 

Ghosts, and the Ostrogradski instability 

In Section |2.1.3[ we discussed the problems associated with ghosts - pathological 
fields that admit physical states of negative energy, or negative norm when quantised. It 
is known that ghosts can occur in general higher derivative theories of gravity, see e.g. 
[TTBTl 12351 15051 15071 12531 1511 13521 ITT31 [550] . They are not, however, as problematic in 
f(R) theories as they are for general fourth-order theories, as we will now outline. Let 
us first consider the existence of negative energy states in the context of Ostrogradski's 
theorem [T00g] . 

The Ostrogradski instability states that Lagrangians that contain second derivatives, 
and are non-linear in those second derivatives, are generically unstable. At first sight 
such a result appears to be problematic for f(R) theories of gravity, which are only linear 
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in second-order derivatives of the metric in the case of General Relativity with a possible 
cosmological constant. One can show, however, that these instabilities do not occur for 
f(R) theories |1281j . This works in the following way: Let us consider a Lagrangian 



L = L(g,g,g), (362) 

where dots denote derivatives of g with respect to some parameter A. Now define a set 
of four canonical variables by Qi = g, Q 2 = g, and 

p dL d dL dL 

Pl ^dg-TXdg> ^ P ^W (363) 

If it is now possible to write g = f(Qi, Q2, P2) then the Hamiltonian of the system can 
be written as 

H = P 1 Q 1 +P 2 f-L(Q 1 ,Q 2 ,f). (364) 

This Hamiltonian, however, is only linear in the momentum Pi, and cannot therefore 
be stable. This is Ostrogradski's instability. Now, f(R) gravity avoids this instability 
because one cannot write down the equivalent of g = /(Qi, Q2, P2) for each component 
of the metric. Instead, only a single scalar degree of freedom contains the higher-order 
derivatives, and by an appropriate field redefinition one can remove this extra field so 
that the redefined metric appears in the Lagrangian only linearly in its second-order 



derivatives. This is just the conformal transformation discussed in Section 4.1.1 The 
Ostrogradski instability does not, therefore, apply to f(R) theories of gravity |1281j . 

Let us now consider ghost-like instabilities from the point of view of linear fluctua- 
tions. In generic fourth-order theories massive spin-2 degrees of freedom appear along 
with a scalar degree of freedom, and the familiar massless spin-2 degree of freedom from 
General Relativity. It is the massive spin-2 fields in this situations that present the 
generic problem with ghosts. Such fields are absent from f(R) theories, however, which 
contain only the massless spin-2 fields of GR, and a single scalar field. Again, this is clear 



from the existence of the conformal transformations outlined in Section 4.1.1 The f(R) 
theories of gravity therefore do not always suffer from the same problems with ghosts 
as more general higher-order theories, which will be discussed in more detail in Section 

MM 



Frolov instability 

A potential problem with f(R) theories that modify the infra-red limit of General 
Relativity has been identified by Frolov in |515j . This instability is caused by the fact 
that for the scalar degree of freedom in f(R) theories curvature singularities can occur 
at finite field value and energy level, a phenomenon previously investigated in [¥, 203J. 



To illustrate this problem consider the f(R) proposed by Starobinsky, Eq. (355). 
The potential for the effective scalar field in this theory is shown in Fig. [5j During 
cosmological expansion, the scalar fields associated with FLRW cosmologies roll down 
the slope from <f> = to the local minimum at <p ~ —0.1. The short section of curve 
between this minimum and the singularity at <f> — is the only part of the potential 
the scalar field need experience in the entire history of a perfect FLRW solution. Frolov 
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Figure 5: The potential for the scalar field in Starobinsky's theory, Eq. | |355| , with R c = 1, n = 1 and 
\i = 2. 



argues, however, that relatively small perturbation in curvature, caused by collapse of 
dust, are enough to push <j> back up the potential to the singularity. The existence of 
such instability is, of course, undesirable for a physically plausible theory, although it 
may be mitigated by the addition of higher power of R to the gravitational Lagrangian 

1231 EH EH . 



Dolgov-Kawasaki instability 



Finally, let us consider another instability that was initially found by Dolgov and 
Kawasaki for the theory given by Eq. (3531 with n = 1 [428]. This was later extended 



to more general functions of f{R) that modify gravity in the infra-red limit |485j and 
formalised better in [1124] . 



The basic point here is that the trace of the field equations, (289), acts as the prop 



agation equation for scalar degree of freedom. For Eq. (3531 with n = 1 this equation 
is 

R \ R 2 



R- 



= 87rp, 



(365) 



where we have taken the matter content to be that of dust. Now, de Sitter space is a 
solution of this equation, with 



R ds = l -{^p + V(^P) 2 + IV) - 8tt P , 



(366) 



for cosmologically relevant \i. If we now consider perturbations around this solution with 
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R = Rds + 5R then we get to lowest order that 



§W(i + § 4 

H ds V H d, 

Comparing this with the propagation equation of a massive scalar field gives 

IdS R dS „ (g7[g 

2 6/i 4 ~ ' 6/z 



p: , DSR + 1 + )5R = 0. (367) 



™ 2 = - ft c - -10 6 GeV, (368) 



where in the last equality the density has been taken to be that of water, p ~ 10 3 kg/m 3 , 
and fi has been taken to be ~ 10~ 33 eV, as required to account for the observed late- 
time accelerating expansion of the Universe. This large negative mass corresponds to 
a catastrophic instability that should make itself apparent on time scales of ~ 10~ 26 
seconds. 

For more general theories it can be shown that the effective mass squared in the 
relevant scalar field equation takes the same sign as Jrr ]485] . It is therefore the negative 



value of this quantity in the theory of Eq. ( 353 1 that is responsible for its exhibition 
of this instability. Further, one can show that the addition of higher powers of R to 
the gravitational Lagrangian again helps defend it from instability [968 , 421,(970]. The 
Dolgov-Kawasaki instability has been shown not to occur in the Palatini approach |1165j . 
The existence of this type of instability was rediscovered in |706j . for relativistic stars. 
The problem in this context has been further studied in [3T71 [7J IH3T1 WM\ 17071 I5UT] . 
where it was shown that the instability can be avoided by changing the equation of state 
of the star, adding a divergence to the scalar field potential, or including chameleon 
effects. Neutron stars in f(R) gravity have been studied in [3321 11095] . and instabilities 
in systems with time-dependent mass have been studied in [60] . 

4-. 2. General combinations of Ricci and Riemann curvature. 

In the previous section we considered theories that generalise the Einstein-Hilbert 
action by replacing the Ricci scalar, R, with some non-linear function, f(R). Here we 
go further, and allow the action to be a function of not only R, but of any of the three 
linear and quadratic contractions of the Riemann curvature tensoip'j R, R^R^" and 
R^upaR^ pa ■ A systematic approach to studying theories of this type, based on minimal 
sets of curvature invariants, is proposed and studied in 646, 906J. 

4-2.1. Action and field equations 

The most general weight-zero scalar density that one can construct from g, R, R^ V R^ U 
and RpvpoR^ 9 ' 1 alone is given by 

C - x^V^gf(R, R^R^,Rpu P aR^ pa ) (369) 

where / is an arbitrary function of its arguments, and x is a constant which can be 
determined from the Newtonian limit. The action is obtained, as usual, by integrating 



26 There is also a fourth possibility, namely ,pa R eT M „ R eT prT |418l . This contraction, however, is of 
limited physical interest as it does not affect the field equations, due to its parity. 

123 



this density, together with that of the matter fields, over all space-time. The addition of 



supplementary terms to the density (3691, in order to cancel total divergences, and which 



can be transformed to integrals on the boundary, can be problematic (see e.g. |849| ) and 
so, for simplicity, they are usually assumed to vanish. 

As with f(R) theories one can proceed by using the metric variation approach (in 
which the connection is a priori taken to be given by the Christoffel symbols), using 
the Palatini formalism (in which the metric and connection are taken to be independent 
fields in the gravitational action), or the metric- affine formalism (in which the metric 
and connection are taken to be independent fields throughout the entire action). Here 
we will spell out the metric variational approach explicitly, as this is the most commonly 
considered form of the theory found in the literature. For studies involving the Palatini 
procedure, as applied to the these theories, the reader is referred to [7921 14771 17941 11731 
17551 HPD21HP70] . 



Varying the action, derived from integrating Eq. (369) over all space, with respect to 
the metric, then gives 
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L2 



fg 



P V f 



f x 5X + f Y SY + f z 6Z 



-2f Y {R p(tl R u) 



-2fz(R P J"R« )tap Sg^ - R u vp °6ir 



(370) 



Vp<7 J 



where we have defined X 



R,Y = R^W and Z = R pvp<y R pvpa . Using 6R mu „ p(T = 
5gt P -ua - 5g tv -pa + 5g pv - ea ) we can then write the grav- 



itational part of the action as 
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dVLyf^gP^ 



(371) 



where 



f x R pv + 2f Y R p ^R v) , 



-2[f Y R p ^)»\ - A(f z R^ p 



VzR 



pac 



g^{f Y R p ")., po 



(372) 



The notation /jy here denotes the functional derivative of / with respect to N. Looking 
for a stationary point of this action, by setting the first variation to zero, then gives the 
field equations 

- g^A (373) 



P Xrji 

r p.v — 2 t*" 



where matter fields and a cosmological constant have been included. Here, A is the cos- 
mological constant (defined independent of f(X, Y, Z)) and T^ v is the energy-momentum 
tensor of the matter fields. These field equations are generically of fourth-order, with the 
exception of cases in which the function / is linear in second derivatives of the metric 
as occurs in GR. 

124 



Unlike the case of f(R) theories of gravity, the theories described by the density given 
in Eq. (370) are not, in general, conformally related to General Relativity with a scalar 
field. 



4-2.2. Weak- field limit 

Let us consider the weak field limit of theories with additional terms in their action 
that are quadratic in curvature invariants: 



C = 



R 



X 



yR 2 + pR^W + iR^paR^' 



(374) 



where a, j3 and 7 are constants. In this case one can use the well known result that the 
Gauss-Bonnet combination of curvature invariants is a total divergence, i.e. 

AR^R^ - R 2 - R ilvpa R ihVpa = total divergence, (375) 

which in the action integrates to a boundary term that is usually ignored. By redefining 
a and j3 we can therefore write Eq. ( 374 ) in the equivalent form 

C = ^ (R + aR 2 + fiR^R^) , 
X 



(376) 



without any loss of generality in the solutions to the resulting field equations. 

If we now substitute Eq. (376) into Eq. (373), to get the field equations, we find that 
for the perturbed metric 

ds 2 = -(1 + 2%lj)dt 2 + (1 - 2ct)){dx 2 + dy 2 + dz 2 ), (377) 

the lowest order equations in the weak-field and slow-motion limit are 

(378) 

(4a + /3)AR- R-2A(i) + j3Ai()) = ~xp, (379) 

where the Ricci scalar is given as usual by R = —2Aip + 4A<fr. For a delta function 
source, p = mS, Eqs. (378) and (379) can then be seen to have the solutions 

-mir 



2(3a + (5)AR - R = -|p 



where 



167T0 
167T0 



ml 



r 

X m 
r 



4e~ 



I ; — — 1 (380) 

I - — — — ! • (381) 

2i^W) ^ ml = "p- (382) 

These are the solutions found by Stelle in 1978 [1187] . These solutions can be seen to 
reduce to Eqs. (308) and (309) in the limit (3 — s- 0_. More generally, however, these 
theories can be seen to exhibit massive modes with two different mass parameters. In 
order to have non-oscillatory behaviour in the present case we must therefore require 
that both 3a + (i > and j3 < be simultaneously satisfied. 
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If the solutions given in Eqs. (380) and (381 ) are the correct ones for describing the 



space-time geometry around approximately isolated masses, such as the Sun, then one 
can immediately see that if mi and mi are both large compared to 1/r then one recovers 
the general relativistic prediction of 7 = 1, just as with f(R) theories. For small masses, 
however, the situation is somewhat different from the f(R) case. If both mi and m.2 are 



small compared to 1/r then one has that the leading order term in Eqs. (380) and (381) 
is a constant (which can be absorbed into coordinate redefinitions), followed by a term 
proportional to r. This is a considerable deviation from the behaviour 7 — > 1/2 that 
occurs when m\r is small and m-iT is large, which is the limit of f(R) gravity with a 
low mass parameter. It can also be seen that for mi < 2m-2 gravity is always attractive, 
while for mi > 2m2 it is attractive over large distances, while being repellent over small 
distances. 

4-2.3. Exact solutions, and general behaviour 

Let us now discuss what is known about the solutions to these general fourth-order 
theories of gravity in the context of both isolated masses, and cosmological solutions. 

Isolated Masses 

Motivation for a number of studies in this area has come from Einstein's particle pro- 
gramme, in which one looks for asymptotically flat and singularity free vacuum solutions 
which could be used to model particles [466] . While it is known that no such solutions 
exist in General Relativity (Lichnerowicz's theorem |797j ). it has been conjectured that 
they could exist in fourth-order theories |1252) . 



By studying the solutions of quadratic theories of the type given in Eq. (376) with 



(3 = —3a it has been shown that the solutions to the linearised vacuum field equations 



can be both asymptotically flat as r — > 00, and smooth as r — ► [503] . These theories 
are equivalent to the sum of an Einstein-Hilbert term and a Weyl term. Such results 
would initially appear to be encouraging for Einstein's programme, but it was later 
shown that there are, in fact, no solutions with the specified properties that exist within 
a neighbourhood of Minkowski space |1116j . This means that if any non-trivial static 
spherically symmetric vacuum solutions to these theories exist, that are simultaneously 
asymptotically flat and geodesically complete, then they must correspond to very large 
energy densities (exceeding the energy density of neutron stars by at least 40 orders of 
magnitude [1116] ). 

The theorems of Lichnerowicz [797 and Israel [650) have more recently been consid- 



ered in the context of fourth-order theories of the form given in Eq. ( 376 1 by Nelson 



[950] . Here it is found that for static space-times with spatial curvature satisfying 

m\- (3) i? > (383) 
R^R^ml + R^R^R^ > 0, (384) 

the vacuum field equations imply that all asymptotically constant solutions (or asymp- 
totically flat, if the inequalities are saturated) obey R^ = 0. The expression for mi is 



given in Eq. (382). Over-bars here denote quantities projected into space-like hyper- 
surfaces. The spherically symmetric solution to R^ v = is, of course, the Schwarzschild 
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solution, which is geodesically complete only for the case of Minkowski space. Lichnerow- 



icz's theorem can therefore be extended to all theories that obey the inequalities (383) 



and ( 384 1 . It is then shown in |950) that if the spatial curvature satisfies 



m 2 



> -m 2 , 



(385) 
(386) 



where m,2 is given by Eq. (382), and the space-time is asymptotically constant (or 
asymptotically flat, if the inequalities are saturated), then the only solutions with m\ > 
that exist in the region exterior to a closed spherical null surface also obey i? M „ = 0. The 
only asymptotically constant vacuum solutions with a horizon, that satisfy the bounds 



(385) and (386), are therefore the Schwarzschild solutions. This extends the Israel's no- 



hair theorem for black holes to quadratic fourth-order theories of gravity. It is argued in 



|950j that the inequalities (383 1- (386) should be satisfied everywhere where the spatial 
curvature is smaller than the scale of the corrections to the Einstein-Hilbert action. If 
these corrections are motivated by quantum considerations, then we should therefore 



expect all of the inequalities (383)-(386) to be satisfied for astrophysically interesting 
systems. The stability of Schwarzschild black holes in the quadratic theories (376) has 
been studied in [P27il ITT86] . 



The initial value problem for quadratic theories, of the type given in Eq. (376), has 
also been studied in |967j . where it was found to be well-posed. 

Cosmological Solutions 

There are a number of exact cosmological solutions known to exist for fourth-order 
theories containing R^R^ and Rp, V poR tluf " J ■ This simplest of these is, of course, de 



Sitter space, which exists for theories with general f(X,Y,Z) in Eq. (369), and a cos- 
mological constant, if 



1 



/ - A = Af x + 2A 2 /y + .A 2 /, 



(387) 



where Jn denotes differentiation of / with respect to N. The stability of de Sitter space 



in quadratic theories, of the type given in Eq. (376 ), has been studied in |102j . and in the 



more general case in |325j . Other known exact homogeneous and isotropic cosmological 
solutions are the Einstein static universe and the Godel universe, the existence of which 
has been discussed in [309 for arbitrary ,f(X, Y, Z) (together with the conditions for 
the existence of closed time-like curves in the case of latter). The existence of power- 
law FLRW solutions, both in vacuum and in the presence of a perfect fluid, has been 
discussed by Middleton in [887J . Power-law scaling FLRW solutions for theories with 
L = R + a^jR 2 - AR^R^ + R^paR^^ have been investigated in [T2"3"3"l . The extent 
to which the FLRW solutions of General Relativity can be reproduced in these theories 
is discussed in [84811949]. 

As well as isotropic cosmological solutions, a number of studies have also been per- 
formed of anisotropic cosmological solutions in these theories. The simplest of these 
are probably the Bianchi / Kasner-like exact solutions found in |311| . which were used 
to show that the infinite sequence of anisotropic oscillations that occurs on approach 
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to the initial singularity in General Relativity does not occur in higher-order gravity 
theories, except in unphysical situations. This type of solution was further studied in 
[557]. Exact Bianchi type 77 and Vlh solutions were found by Barrow and Hervik in 
for quadratic theories of the type ( 376 1 , and were used to show the lack of valid- 



ity of the cosmic no-hair theorems in these theories: Anisotropic inflation with positive 
A is possible, without de Sitter space as the late-time asymptote. These authors also 
considered the general behaviour of Bianchi type I and II solutions in quadratic theo- 
ries, where the possibility of a stable isotropic singularity was discovered |103j . Bianchi 
type I, IV, Vlh and Vllh universes have been studied in [104] . where it was shown 
that periods of anisotropic expansion can occur after a near isotropic expansion, and 
before re-isotropisation at late-times. Bianchi type VII a solutions have been studied for 
quadratic theories in [367] , and Bianchi type IX universes have been studied by Cotsakis 
et al. in [340] . where the Kasner solution of General Relativity was shown not to be a 
stable early asymptote of the quadratic theories given in Eq. (376). 

Stability of past isotropic attractors has been the subject of study by Barrow and 
Middleton |107[ 1888] , In the first of these papers the authors demonstrated the stability 
of past isotropic solutions to the quadratic theories ( 376 ) under scalar, vector and tensor 
inhomogeneous perturbations [107] , This supports the hypothesis that small pertur- 
bations to the past isotropic attractor form part of the general cosmological solution to 
quadratic theories of fourth-order gravity. This study is extended to theories with power- 
law curvature terms, (R fJjU R fl ") n , in their Lagrangian in [888], where conditions are given 
for the stability of early isotropic states. This study also shows the instability of the 
exact solution found in [311] . as the initial singularity is approached. The asymptotic 
behaviour of theories with quadratic corrections to the Einstein-Hilbert were studied, 
in the context of string cosmology, in [1090, 236:, 11218] , Exponential and power-law 
FLRW solutions in higher-dimensional string inspired models are found in [8551 18561 IT7] . 
The evolution of FLRW solutions in generalised theories has also been studied using a 
dynamical systems analysis in 



4-2.4- Physical cosmology and dark energy 

Having discussed various cosmological solutions in these general fourth-order theo- 
ries, let us now consider their relevance for observational cosmology and dark energy. 
We will proceed with this by first discussing studies of more general theories, followed 
by theories constructed from the Gauss-Bonnet curvature invariant. The Gauss-Bonnet 
invariant has special properties, which we will discuss in the Section [4. 2.5| 



General Theories 



In order to construct cosmological models that can produce late-time accelerating 
expansion the authors of [254] considered theories of the type 

4n+2 

L = R+- — „ - — , (388) 

where \i, n, a, b and c are constants. It is found that for these theories there exist 
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power-law attractors for the general spatially flat FLRW solutions, which are given by 



2 + 10n + 2-3a±Vr 

'<!/! = «,! (^) • (*«» 



where 

12a + 46 + 4c 



(390) 

12a + 36 + 2c v ' 

T = 9n 2 a 2 - (80n 3 + 116n 2 +40n + 4)a 

+64n 4 + 160n 3 + 132n 2 + 40n + 4. (391) 



The smaller of the exponents in Eq. (389) can be seen to — > as n — > oo, while the 



larger tends to An. For large n it is therefore the case that accelerating expansion can 
occur at late times. This is a generalisation of the type of model considered in |255) . for 



f(R) gravity. In [880] it is shown that while the theory given in Eq. (388) is capable 
of explaining the supernova results, to do so and still have an acceptable age for the 
Universe it requires the matter content of the Universe to have an equation of state 
0.07 < w < 0.21, to 2a. The FLRW solutions of theories with powers of R, R^R"" and 



R/j,vpcrR flvpc ' added to the Einstein-Hilbert action were also studied in [455j . where the 
possibility of late-time accelerating expansion was considered. 

Primordial nucleosynthesis in theories with powers of R^R^ V added to the Einstein- 
Hilbert action have been considered in |957) . where constraints from observed element 
abundances are imposed. As with scalar-tensor theories, the constraints imposed from big 
bang nucleosynthesis are largely due to the different expansion rate during the radiation 
dominated period due to a different value of the effective Newton's constant. 

The addition of a conformally invariant term to the Einstein-Hilbert action has been 
considered in [9531 19541 19511 1952) . In this case the gravitational Lagrangian takes the 



form given in Eq. (376) with 3a + f3 = 0, and the resulting field equations are some- 
times known as the 'Bach-Einstein equations'. The solutions to these equations have 
been studied in the context of inflation |953j , the evolution of background cosmological 
models [954] . the observational constraints available from pulsars [951] . and weak fields 
and gravitational waves |952) . Theories of this type are motivated, in part, from non- 
commutative geometry |268j . For a review on short scale modifications of gravity in the 
context of non-commutative geometry, see [961] . 

Theories with L = f(R, R 2 - AR^R^ + R^paR^ 1 " 7 ) 

Theories that are functions of the Ricci scalar, R, and the Gauss-Bonnet combination, 

G = R 2 -AR^R^ + R^ pa R^ pa , (392) 

have been particularly well studied, as they arc motivated by string theory [8861 15301 15291 



|, and have improved stability properties (as will be discussed in Section 4.2.5). The 
linear case of / = R+G is known to be equivalent to the Einstein-Hilbert Lagrangian in 4 
dimensions, up to surface terms, but more general functions, of the type f(R, G), produce 
field equations that differ from those of General Relativity. The mathematical properties 
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of the Gauss-Bonnet tensor, that occurs from varying the action of these theories, as well 
as the more general Lovelock tensor, have been studied in [4931 14911 14921 1494] . 

The general behaviour of spatially flat FLRW solution in theories with L = R + /(G) 
has been studied by Zhou, Copeland and Saffin in |1309) using a phase plane analysis. 
In this case the Friedmann equations become 

3H 2 = Gf d -f-2AH 3 f d + ^p (393) 
2H = 8H 3 f d -16HHf d -8H 2 f 6 + ^(p + P), (394) 

where R = 6(i? + 2H 2 ) and G = 2AH 2 (H + H 2 ). The existence of both stable de Sitter 
space, and phantom-like accelerating solutions to the above equations can be demon- 
strated, as well as trajectories in the phase space that mimic the evolution of the standard 
ACDM universe through radiation and matter dominated periods [1309 . The stability of 
de Sitter space, as well as radiation and matter dominated epochs has also been studied 
by de Felice and Tsujikawa in |371j . where the conditions Jqq > and fgg — > 0+ as 
|G| —> oo were found to be required for models to be viable. These authors suggest the 
following functional forms for /(G) as examples that satisfy these conditions, and could 
produce acceptable expansion histories for the Universe: 

A 
" 2 



G* In ^1 + ^-J - a\\J G* (395) 
-a\\/G*, (396) 



where a, A and G* are constants. It is further claimed that these forms of /(G) are com- 
patible with solar system observations [372] . producing corrections to the Schwarzschild 
metric that are of the form ~ H 2 r 2 {r /r s ) p , where r s is the Schwarzschild radius of the 
Sun, H is the Hubble rate and p is a model dependent quantity. Much larger correction 
to General Relativistic predictions are claimed in |364| for theories with polynomial addi- 
tions of the Gauss-Bonnet term, G™, to the Einstein-Hilbert action. The cosmologies of 
these theories, and theories with inverse powers of aG+/3R added to the Einstein-Hilbert 
action (where a and f3 are constants), have been considered in |3911l979j . while the FLRW 
solutions of other R+f(G) theories have been considered in [971, 323J . The 'inverse prob- 
lem', of finding FLRW solutions that behave like ACDM has been considered for R+f(G) 
theories in |324J. The phase space of FLRW solutions to L = f(R, G) theories, and the 
transition from deceleration to acceleration, has also been studied in [25, 26, 24] . Super- 
nova, BAO and CMB observations have been used to constrain L = R + /(G) theories 
in [SD7j. 

Linear perturbations around spatially flat FLRW universes have been studied in L = 
f(R, G) theories by de Felice, Gerard and Suyama in [368) . and in L = R + /(G) theories 
in particular by Li, Barrow and Mota |791j . The former of these studies uses the velocity 
potentials and variational principle approach of Schutz 1119! , while the latter uses the 
covariant formulation of Ellis and Bruni [469 . In the case of general f(R, G) it is found 
that scalar perturbations in these theories have, in general, six degrees of freedom, two 
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of which propagate on small scales with group velocity [368] 



3 (8f Rd H* ■■ Ki// ; / ( ,, ; • .//,.,,.;;./,,. • !//./,, 



There can be seen to be a /c-dependence in Eq. (397), which does not occur in General 
Relativity. Such a relation had previously been found for the vacuum case in |393| . 
where it was argued that the space-time is unstable if v 2 g < 0, or has super-luminal 
modes in short wavelength modes if Vg > 0. These features are problematic, but can be 
avoided in theories that satisfy JrrJqq — f RG = 0. Such theories have scale independent 

propagation speeds only, and include the cases of L = f(R), f(G), G+f(R) and R+f(G). 
The latter case, being the subject of study in |791j . has also been shown to suffer from 
matter instabilities. This is due to the evolution equation for perturbations to Jq, which 
we write as e, and which obeys [791] 



AO 




27(3 - AfG9) 



GG 



e = S, (398) 



where 9 = 3H is the expansion scalar, and the reader is referred to [791] for the form of 
the source term S. For stability it is required that the third term in the square brackets 
be positive, and remain dominant over the first, which is expected to be negative during 
matter domination. This requires > 0, and for /^ff 6 to remain suitably small, in 
order to avoid instabilities [79111370] , These are strong constraints on the forms of f(G). 
It was further shown in [368] that vector modes in the general case of f(R, G) decay, and 
that the propagation of tensor modes in these theories is model dependent. 

4-2.5. Other topics 

Let us now consider some remaining topics in fourth-order gravity, that have yet to 
be discussed. 

Theories with L = f{R, 0, R 2 - AR^R^ + R iiVpa R ,l ' Jpa ) 

There has been some study of fourth-order theories that include a scalar field, as 
well as the Ricci scalar and the Gauss-Bonnet scalar, in the gravitational action. This 
is motivated by the dilaton that arises in string theory compactifications |886j . and has 
been studied in terms of 'pre-big bang' cosmology in [530, 529] 1264] . In this scenario 
there is an early period of very rapid expansion due to the kinetic term of the scalar 
field. Black holes in these theories, and their extensions, have been studied in [890[ 
15591 15751 H3 IT2091 17001 12231 I25T1 15751 1571 15M1 IM51 15571 13551 13251 15M1 [555]. Further 
string motivated study of FLRW cosmology in the context of these theories has also been 
performed in [5J [T2T71 H^Hl WZA[ [575] . 

Late-time acceleration has also been studied in theories where a scalar field has been 
included, along with R and G, in the gravitational action (9821 [960] [9591 EH WM 17291 
[12231 [TU971 [Ml GUI IM3 ISO]. These papers have considered the evolution of FLRW 
space-times, as well as inflation, structure formation, and the constraints that can be 
imposed upon them from supernovae, CMB, BAO, solar system observations, and pri- 
mordial nucleosynthesis. These observations place strong constraints on the theories. 
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Greater-than- fourth- order Theories 



Another option that has been considered in the literature is that the action itself 
could contain derivatives of curvature invariants, so that [206J 



L = f(R,OR, □ 2 i?,...,D"i?), 



(399) 



where □ is the D'Alembertian. Extremising the action associated with this Lagrangian, 
by varying the metric, gives the field equations [1117] 



1 x 

yR^v ~ ^fg^v - y^v + g^oy + x^ v = - t^, 



(400) 



where 



y 



E 



i=0 



Of 



3(0*11) ' 
df 



3=1 



d(n>R)' 



The field equations ( 400 ) can be seen to generically contain derivatives of the metric of 



order 2n + 4, so that the familiar fourth-order theories discussed above are recovered 
when n = 0. Theories with infinite n have also been considered in the literature, and 
have been claimed to be ghost-free [1641 1163] . 

Greater-than- fourth-order theories can be shown to be equivalent, under a conformal 
transformation, to General Relativity with two scalar fields [550 , and their Newtonian 
limit has been considered in [657] , where it was found that the familiar form of Newto- 
nian potentials and Yukawa potentials can be present. Their consequences for inflation 
have been studied in jH31 QH EH HH [SHI ESS], and the attractor nature of de 
Sitter space established. Bouncing cosmologies in these theories have been considered 
in [1641 11621 1163] , and the form of the CMB has been investigated in [162] . Primordial 
nucleosynthesis has been considered in [957] , and the consequences of this type of theory 
for dark energy have been considered in [977] ■ For a more detailed overview of theories 
with greater than four derivatives of the metric in their field equations the reader is 
referred to pTT5] , 

Conformal Gravity 

One more possibility is to completely abandon the Einstein-Hilbert action, even as 
a limiting case of the fundamental action. Such a proposal has been advocated by 
Mannheim [866] in the case of conformal Gravity. Here the Einstein-Hilbert action is 
replaced by 



Sc = -G!G 
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(401) 



where C\^ K is the Weyl tensor, given by 



—-^isxvRfiK — gxuRfiv — g^uRxn + g^Rxv], 

and «g is a dimensionless gravitational self-coupling constant. It has been shown that 
such an action can be obtained from the path integral of fermionic degrees of freedom 
for the conformal and gauge invariant action of a fermionic field. 

The case has been made that such a theory has a number of desirable properties 
eluding other higher derivative theories of gravity. Even though the equations are fourth 
order, signalling the presence of negative energy states, it has been shown that such states 
are completely decoupled from what the authors dub the physical sector |865j . Conformal 
gravity can then be held up as a viable theory of quantum gravity. Furthermore, the 
peculiar ultraviolet properties of conformal gravity have been argued to lead to a solution 
to the cosmological constant problem. 

At a classical level, conformal gravity has been shown to have intriguing properties. 
For a start, the non-relativistic limit of the field equations leads to a fourth order differen- 
tial equation for the gravitational potential, in which the usual Newtonian potentials 
that drop off as 1/r are but one possibility. The general weak gravity potential is of the 
form 

j\ 

$(r) = V Br. 

r 

Such a form, it has been argued, can be tuned to fit a range of galaxy rotation curves. As 
in some other theories of modified gravity, this is achieved by fixing universal parameters. 
This may be contrasted with the usual dark matter prescription in which, for each galaxy, 
one can choose the properties of the dark matter halo. 

The situation becomes more complicated once one adds couplings to matter fields 
[864]. Conformal symmetry is spontaneously broken through a new scalar degree of 
freedom, S, such that the general (conformal) matter action is 

hi = -hj d^V^i^S^ - ^S 2 R% + AS 4 + i^V^ - gS^}, 

where -0 is a fermionic field. One can extend this to more general actions containing 
scalar and fermionic fields. The vacuum expectation value of the scalar field is what 
then sets the Gravitational constant and the coupling to matter. It can also be used to 
renormalise the cosmological constant. 

Although some of the quantum properties of conformal gravity have been worked out, 
a fully consistent and complete analysis of their cosmology is still lacking. In particular, 
and in its current incarnation, in which no dark matter is invoked, it is unclear how the 
correct angular diameter distance for the CMB can be obtained. 

Theories with L = f(T) 

An interesting variant on generalisations of the Einstein-Hilbert action are the L = 
f(T) theories, where T is a contraction of the torsion tensor (defined below). These 

133 



theories generalise the 'teleparallel' approach to General Relativity, which corresponds 
to a Lagrangian L = T, from which Einstein's equations can be derived [1255]. Here T 
is defined by 

T = -T^PT^ p + ^T^"T pvp - T^T"" p , (402) 

where T^ v is the torsion tensor, defined in terms of the vierbein from g^„ = T] a ph"h^, 
as 

T% = K (d»h a p - d P K) . (403) 

This definition is equivalent to setting T^ vp equal to the antisymmetric part of the 
Wcitzcnbock connection. Now, varying the action 

9 f(T)dn (404) 



16ttG 

with respect to the vierbein fields gives the field equations 

h [r,u - \g^R\ + \ g>lv (/ - f t T) (405) 

where we have called the energy-momentum tensor 0^„ , to distinguish it from the torsion 



tensor. It can be seen that in the case f(T) = T the field equations (405) reduce to 
Einstein's equations, so that the theory L — T is equivalent to the Einstein-Hilbert 
action, as stated above. For f ^ T, however, the teleparallel approach outlined here 
gives different field equations to the fourth-order theories we have so far considered. 

It was within the framework of these generalised equations that Bengochea and Fer- 
raro suggested that the late-time accelerating expansion of the Universe could be ac- 
counted for without dark energy |135j . These authors considered the particular case 



/ = T -(=Tp (4 ° 6) 

where a and n are constants, and constrained the resulting FLRW cosmology they found 
with supernovae, BAOs and the CMB. They found the best fitting model has n = — 0.10, 
= 0.27, and has the required radiation, matter and accelerating epochs. A large 
number of papers have followed |I35j in a short space of time, exploring the transition 
from deceleration to acceleration, observational constraints, conformal transformations, 
and structure formation [5J32 fT^TT 15521 IT2551 IT22T1 [25J HM fT2551 [75J [H5J USD EMI 
Ml 

It has been shown, however, that these theories do not respect local Lorentz invari- 
ance, and have a number of extra degrees of freedom that are not present in General 
Relativity as a result [796] . This can seen by noticing that one can write the Ricci scalar 
in terms of T^ vc as 

R = -T- 2V^ (T u ^) . (407) 

Now, while R is of course a Lorentz scalar, one can show that V M (T"^) is not. It there- 
fore follows that T is not a Lorentz scalar either, and so the f(T) theories do not exhibit 
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local Lorentz symmetry. The exceptional case is / = T, in which case the non-Lorentz 
invariant part of the action can be seen from Eq. ( 407 ) to be a total divergence, which 
does not affect the field equations. 



Stability Issues 



There are serious concerns with the stability of general theories of the type L = 
f(R, RpvRf 1 ", RfiupaR^^)- Not least of these is the presence of 'ghosts', or perturbative 
modes with negative norm, as well as tachyonic instabilities in massive modes [1187, 236, 



\mmMnmttMR[m%\Tmm()\ . 

Let us now outline how ghost terms arise in these theories, following the discussion 
of |606l 16071 1293] . This starts by considering the quadratic theory 



R 



1 



6?71q 



R z - 



1 



1m\ 



(408) 



where m and TO2 are constants, and C 2 = C i i vpaC p,upa 



D TDpvpu 
1 ^pvpo 1 L 



2R uv R fiU + R 2 



is the square of the Wcyl tensor. Any quadratic theory can be written in this form, up 
to boundary terms, because of the Gauss-Bonnet identity. We now want to identify the 
scalar and spin- two degrees of freedom in Eq. (408 ), for which it is convenient to introduce 



auxiliary fields that play these roles, and to transform so that they have canonical form. 
We will do this now, following [606]. By introducing an auxiliary scalar field, ip, and 



performing a conformal transformation g pv = e v g^ v , Eq. ( 408 ) can be rewritten as [606] 



C = yj-g 



R- 



3m 2 , 



C 2 
2m\ 



(409) 



By extremising this equation with respect to ip, and substituting the resulting value of < 



back into the Lagrangian density, one recovers Eq. (408). It can be seen from Eq. (409) 



that ip now has the kinetic term of a canonical scalar field. By introducing a second 
auxiliary field, ir^, and transforming the metric so that g^ u = g pp A p v j y/\A\, where 



A/ = (1 + \4>)8 P V - </>/, we can then rewrite Eq. (4091 as [606 



C 



3m 2 , 



2 - (^(T 



(410) 



(411) 



where in the second equality we have expanded out to quadratic order in ip and 
around zero, so that we are considering theories that are close to GR. The field <f>" = 
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has been introduced here to make clear with respect to which metric the indices are being 



raised or lowered: those of ir„ v are raised and lowered with 



9» v 



while those of 



raised and lowered with g^. The quantities C M are defined as 



C7", 



vp 



(412) 



The Lagrangian given in Eq. (411) now has scalar, <p, and spin-2 modes, 0„ y , both in 
canonical form. It can be seen from Eq. (411) that for real (ft" the spin-2 field does 



indeed have the wrong sign before its kinetic term, and is therefore generically a ghost, 
while for real tp the scalar mode is not |1270j . What is more, if < or m\ < then 
the scalar or spin-2 modes exhibit tachyonic instabilities, respectively. 

Having outlined the proof for the generic existence of spin-2 ghosts in quadratic 



fourth-order theories of gravity, ( 408 1 , let us now extend this to more general theories of 
the form 

C = y/=gf(X,Y,Z), (413) 



where X = R, Y = R^R^ and Z = R tlupa R ilv( " J . This demonstration proceeds by 
showing that the particle content of theories of the type (413) are the same as the 



quadratic theories (408) (at least, when considering fluctuations around de Sitter space), 
and then using the result derived above, that these theories generically contain spin-2 
ghosts. The first step here is to introduce auxiliary fields <fii, (f>2 and (f>3 so that the 



Lagrangian density (413) becomes [607, 293 



C = V=g[f + h(X - fa) + h{Y - <h) + h(Z - h)} , 



(414) 



where fa = df/dfa, and i = {1,2,3}. As long as is non-degenerate, extremising with 
respect to the these three new fields then gives 4>\ = X, <f>2 = Y and <p^ = Z, so that 
Eq. (|413[) is recovered. Using the Gauss-Bonnet combination, and discarding boundary 



(415) 



terms, this equation can then be written as [293 
C 



V^g[(f - 4>ih - 02/2 - foh ) + hR + g(/i + h)R 2 

+ \(h+±h)CpvpoC^e° 

-~(/a + 2/ 3 ) [C^C^P* - 2R„ V R^ + ^R 2 



where C^pa is the Weyl tensor, and the last quantity in brackets is the Gauss-Bonnet 
combination. At this point the theory has been shown to be equal to a scalar-scalar- 
scalar-tensor theory, with the three scalars non-minimally coupled to quadratic curvature 
invariants. Finally, we can choose to expand the density above up to second order in 
fluctuations around a de Sitter background with constant Ricci curvature, Rq, so that 
[513] 
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C 2 



(416) 
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where boundary terms have been ignored, and we have defined 



A = 



fx + X 2 (-fxx - -jjy 



(/- 

+XH l -f XY + \fxz) + X\ l -f YY + ^f zz 

o 



Xfr 



2 

— ^ l/xy + g/jfz) 

;^iw + ^/zz + \.fv 

V + 2/ z 



gJVz))( 



'0 



2/y 



g/yz))o 

+ Vxz) 

3 i 

(^Iyy + g/zz + /yz))o 

m 2 " 2 = -(/y+4/ z > , 

where (• • ■ )o denotes the value of the quantity inside the brackets on the de Sitter back- 
ground. It should be clear that Eq. (416) is identical to Eq. (4081, up to the values 
of a and A. The particle content of general theories of the type (413), on a de Sitter 



background, therefore also has a scalar mode with mass too, and a ghost-like spin-2 mode 
with mass TO2. 

As mentioned in Section 4.2.4[ it has been suggested that theories that are only 



functions of the Ricci scalar, R, and the Gauss-Bonnet combination, G = R 



-&R IXV R> 1V - 



R^vpaR llupa j can evade the ghost problem outlined above |944) . The reason given for this 
is that the mass term m^ 2 — > as jy — > 4fz, a condition that is satisfied for theories of 
the type L = f(X, Z — 4Y), or, equivalently, L = f(R, G). When m^ 2 vanishes it can 



be seen from Eq. (416) that the term responsible for the ghost spin-2 fluctuations will 



also disappear. Further requirements for the non-existence of ghosts in f(R, G) theories 
are discussed in [392], with particular reference to the model of [254] . Such theories 
may still be subjected to constraints on their parameters by the possible existence of 
tachyonic instabilities, if to 2 , < 0. For theories with L = f(R, G) it can be seen that 
m Q 2 = \R 2 fGGi so that the condition TOq > is equivalent to the stability condition 
Jgg > found in 371 , 370 s and |791j in the context of cosmology. 

4-. 3. Hofava-Lifschitz Gravity 

Hofava-Lifschitz (HL) gravity was proposed as a toy model of quantum gravity [618[ 
619, 620J. The model is non-relativistic and relies on anisotropic scaling between space 
and time in the UV to help render the theory asymptotically safe. Furthermore, it 
was claimed that General Relativity could be recovered in the infra-red by including 
additional relevant operators. HL gravity in its various guises has been reviewed in a 
number of articles [TUT61 ITT661 [T266t 19261 [170] . 

To understand the idea behind HL gravity we must first understand why perturbative 
General Relativity is not UV complete. The non-renormalisability arises because the 
coupling constant has negative mass dimension, [G] = — 2, and the graviton propagator 
scales as 1/p 2 . Consider the following scalar field theory, 



S = 



1 

~2 
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(dip) 2 + \<p 6 



(417) 



Again, the propagator scales like 1/p 2 , and the coupling constant has mass dimension 
[A] = —2, so schematically at least, one might expect this theory to be non-renormalisable 
too. To render the theory asymptotically safe, we need to improve the UV behaviour of 
the propagator. One might do this by adding relativistic higher derivative terms to the 
action, but this is known to introduce an additional ghost-like degree of freedom. The 
reason for the existence of this ghost can be traced back to higher order time deriva- 
tives as opposed to space derivatives. This observation suggests an alternative approach. 
Let us abandon Lorentz invariance and introduce higher order spatial derivatives with- 
out introducing any higher order time derivatives. The former should improve the UV 
behaviour of the propagator, whereas the latter guarantees the absence of ghosts. We 
therefore modify the kinetic term 

- ^ - ^>(-A)V, (418) 

where A is the spatial Laplacian. We now have a non-relativistic dispersion relation 
w 2 oc k 2z , which means that time and space scale differently, 

x -> Ix, and t -> l x t, (419) 

For large enough z, it follows that the coupling constant has a non-negative scaling 
dimension, [A] — Az — 6 so we expect the theory to be power counting renormalisable, 
and ghost-free. On the flip side, we have broken Lorentz invariance, which is well tested 
at low energies. However, we can cope with this by adding a relevant operator of the 
form C re i = ic 2 ipA(p. This leaves the good UV physics unaffected, but allows Lorentz 
invariance to be restored as an emergent symmetry in the IR, with an emergent speed of 
light c. 

In HL gravity, one applies similar logic to the relevant perturbative degrees of freedom, 
schematically replacing <j> with the graviton, hij. Since we will require time and space 
to scale differently in this model, we must first choose a preferred time, t, which in the 
language of General Relativity means making an ADM split [902] 

ds 2 = -N 2 c 2 dt 2 + q ij (dx i + N i dt)(dx j + N j dt), (420) 

where qij(x, t) is the spatial metric and N l (x, t) is the shift vector. For the lapse function 
we consider two separate scenarios: (i) the projectable case where the lapse N = N(t) is 
homogeneous and (ii) the non- projectable case where the lapse N = N(x,t) can depend 
on space. Having chosen a preferred time, we no longer have the full diffeomorphism 
group, Diff(A4), but a subset known as foliation preserving diffeomorphisms, Diff(A4, J 7 ), 
generated by 

St = f{t), and 5x { =C(x,t). (421) 
Diff (A4 , F) is defined by the following set of infinitesimal transformations 

6N = dtiN^+CdjN (422) 
6N l = + (423) 

hij = fdtQij+ C-niij- (424) 

Note that this hard breaking of diffeomorphism invariance is at the root of many of 
the problems facing HL gravity as it allows additional degrees of freedom to propagate 
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|2301 I278j . To see the extra degree of freedom emerge it is convenient to perform a 
Stuckelberg trick [T076 ., and artificially restore full diffeomorphism invariance at the 
expense of introducing a new field - the Stuckelberg field. This field becomes strongly 
coupled as the parameters of the low energy theory run towards their diff-invariant values 
[278] (see also p7l 17351 IT025] ). 

We can think of the lapse and shift as playing the role of gauge fields in Diff(A / i, J 7 ). 
It follows that the projectable case is the more natural since then the gauge fields have 
the same space-time dependence as the corresponding generators. Having said that one 
might expect it to be easier to match the non-projectable case to General Relativity in 
the infra-red. 

In any event, the action from these theories is built from objects that are covariant 
with respect to Diff(.M, J-). These are the spatial metric, and the extrinsic curvature, 

Kij = Qij — 2V(jATj) , where V, is the spatial covariant derivative. In the non- 
projectable case one should also consider terms built from eij = V, log AT [167] . To build 



the gravitational analogue of the action (417), we replace the kinetic term such that 



-> —y/qNiKtjlCi - XK 2 ), (425) 

2 K G 

where kg is the gravitational coupling with scaling dimension [kg] — z ~ 3, and A is 
a dimensionless parameter that also runs with scale. Clearly, for the z = 3 theory 
the gravitational coupling constant is dimensionless, which may lead one to suspect the 
theory to be power counting renormalisable. For z = 3 the leading order term in the UV 
part of the action becomes 

- ^(-A)V -> -kgVqNV 6 , (426) 

where the dimension six contribution to the potential is 

V 6 = pVkRij V k R ij + .... (427) 

Here /3 is a dimensionless parameter, Rij is the spatial Ricci tensor and ". . ." denotes 
any of the other possible dimension 6 operators that one might wish to include, e.g. 
R 3 ,RAR, (aX) 3 etc. 

Now let us consider the type of relevant operators one might add. If we demand our 
action to be invariant under spatial parity x l —> —x 1 and time reversal t — > —t, then we 
only need consider even dimensional operators, 

Crei = - — yfqN(V 2 + k g V 4 ). (428) 

At dimension four these are 

V 4 = AiR 2 + .1, />•,./."■' + + J Bi J Ra i a J + B 2 R l3 a l a j + B 3 Ra\ 

+ di^a,) 2 + C 2 (a l a i )a : > j + C 3 (a Z i) 2 + C 4 a ll a ij: (429) 



whereas at dimension two, 

V 2 = — a(a l Oi) — c R. 
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(430) 



Here we have introduced the notation a %l '" %n — V l1 • • -V*" log TV. Of course, terms of 
this form are only relevant in the non-projectable case. 
The full z = 3 gravitational theory is now given by 

S = I dtd 3 x —yfqN [K %3 K %] - \K 2 + c 2 R + a{a l ai ) 2 - k g V 4 - k%V 6 ] +S m , (431) 

where S m [N, N iy q^; is the matter part of the action, and Vg is the relevant dimension 
6 operator. Note that in the absence of full diffcomorphism invariance we do not require 
matter to satisfy energy-momentum conservation |278) . Indeed, in general we expect 
to see violation of energy conservation since Diff(A^) breaking operators in the gravity 
Lagrangian will induce Diff(A^) breaking quantum corrections to the matter Lagrangian 

Let us compare this to the Einstein-Hilbert action, written in terms of the ADM 
variables as 

~K i4 K ij -K 2 



R . (432) 



The claim is that A = 1 and a = are the infra-red fixed points of the renormalisation 
group flow. Of course, the parameter a plays no role in the projectable theory, since 
any terms containing a, = V, logiV will vanish for N = N(t). For both the projectable 
and non-projectable theories, the free parameters run to their infra-red fixed points at 



low energies, so that the HL action (431 1 tends towards the Einstein-Hilbert action (432) 
with an emergent speed of light, c, and an emergent Newton's constant, G — kqc 2 /I6ir. 

Before delving further into the different manifestations of HL gravity, let us pause 
to make a few general comments. The first of these is with regard to the large number 
of terms appearing in the potential. We have not bothered to present the contributions 
from dimension six operators since they are two numerous^] To reduce the number of 
terms, Hofava originally borrowed the notion of detailed balance from condensed matter 
theory |619j . but this has since been shown to lead to phenomenological problems 278, 
1169] [Tl70| . Of course, the large number of terms appearing in the potential is really 
only an aesthetic concern. 

A second, more serious, concern, involves the fine tuning of light cones for each field. 
Since Lorentz invariance is not exact there is no symmetry guaranteeing that all fields see 
the same emergent light cone. We would like there to be some mechanism suppressing 
Lorentz violating operators at low energies but preliminary investigations suggest that 
fine tuning is required [1045] . It is possible that supersymmetry may be able help with 
this to some extent |565j . 

Another issue that has yet to be fully explored concerns possible equivalence principle 
violations in HL gravity |692| . To see how these might arise it is convenient to go to the 
Stuckelberg picture. The Stuckelberg trick was developed in the context of massive gauge 
theories [1076j . but it has proven very useful in elucidating some of the key physics of 
HL gravity as well (see, for example, [278111671 1692] ). Recall that the anisotropic scaling 
of space and time requires a hard breaking of Diff(A^) down to Diff(A4, J 7 ). We can 



27 Note that the full set of inequivalent terms up to dimension six has been presented for the projectable 
ase in [TT691 ITTTOl . 
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artificially restore full diffeomorphism invariance by redefining the ADM slicing in terms 
of the Stuckelberg field cj)(x,t). That is, the slicing goes from 



t = constant 



(x, t) — constant. 



(433) 



The unit normal to the spatial surfaces is covariant under Diff(.M) and given by the space- 
time gradient n M = V^/y/— (V</>) 2 . We can now express HL gravity as a relativistic 
theory involving the space-time metric, g^ v , and the Stuckelberg field [167, 692, 532J. 
We do this by defining the space-time analogue of the spatial metric and the extrinsic 
curvature in terms of the projection tensor q^ = g^+n tl n ll and its Lie derivative \L n q jlv 
|532) . Violations of the EP can occur because the Stuckelberg field can mediate a force 
between matter fields carrying Stuckelberg "charge". As shown in |692j . Stuckelberg 
"charge" is a measure of violation of energy-momentum conservation, schematically given 
by 



T< Tiiv 



^0. 



(434) 



This is allowed by foliation preserving diffeomorphisms which simply require [692J 



ga,V M T^ = 0, 



and 



1 SS n 



V-9 H v/-(V0) 5 



(435) 



Even if the Stuckelberg charges, r^Ta, ... are always small, violation of the EP can 
still be large since the relevant Eotvos parameter -q ~ ri+r2 really only cares about the 
charge ratios. 



4-3.1. The pro jectable theory 

We now focus on the projectable version of HL gravity, for which the lapse function 
is homogeneous, N = N(t). The action is then given by 

S = ( dtd 3 x —y/q^N [KijK i3 - \K 2 -V]+S m , (436) 

where the potential V = —c 2 R+ higher derivative operators. Since the condition of 
projectability is imposed at the level of the theory itself, it follows that the Hamiltonian 
constraint is non-local: 

J d?x^-q [K^ XK 2 + V] = (437) 

In comparison with GR where the Hamiltonian constraint is local, this admits a much 
larger class of solutions. Indeed, it has been suggested that the resulting integration 
constant can account for dark matter |924j . although this may lead to the formation of 
caustics and the break down of the theory |167j . 



Dark matter as an integration constant 
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To see how this might emerge, we rewrite the action ( 436 ) in the following form 



c 4 



S -1^G / ***** 



— s + R 



2 r 

+ i 1 ~ X )tf~7; / dtdPxJqNK 2 + UV corrections. (438) 
lD7rG J 

Focusing on the low energy theory, the resulting field equations are |924j 
d z xJ~- 



G (4) _ 




= 0(1 


-A) 


(439) 


(off 


-^y 


= 0(1 


-A) 


(440) 




(4 ) 8ttG 


= 0(1 


-A), 


(441) 



where g^ v is the full space-time metric, G$ is the corresponding Einstein tensor, and 
n M = jj(l,—N l ) is the unit normal to hyper-surfaces of constant t. The stress energy 
tensor, T^ v , is not necessarily conserved, as previously stated. Note that the non-local 



Hamiltonian constraint, (439), and the local momentum constraint, (440), are preserved 



by the dynamical equations (441) 



Now, these equations can be rewritten as follows [924 : 

G $ = + T* L ) + 0(1 - A), (442) 

where T^ V L = p HL n^n u and J d 3 x^/qp HL = 0. Note that this latter condition does not 
require p HL to vanish at all points in space, and one might wish to identify T^ V L with a 
pressureless fluid moving with 4- velocity n M . Taking p HL > in our Hubble patch, we 
may associate this integration constant with dark matter [924] . 

This scenario has been criticised in |167j . where it is argued that the cosmological 
fluid T^ V L will inevitably lead to the formation of caustics and the break down of the the- 
ory. To see why this might be the case it is convenient to go to the Stuckelberg picture, 
where we identify the unit normal with the space-time gradient = ^ u<j)/ \f— (V<^) 2 . 
Now T^, L behaves like a pressureless fluid, and in General Relativity it is well known 
that this will lead to the formation of caustics, due to the attractive nature of grav- 
ity. This is not a problem for real dust, as virialisation can occur. However, in the 
scenario of [924], the fluid is characterised by the gradient V^cj) which is problematic as 
the Stuckelberg field is not differentiable at the caustic. These conclusions have been 
disputed in |923j where it is argued that as the putative caustic begins to form we enter 
a UV regime and the parameter A runs away from its IR fixed point at A = 1. At A =/= 1 
it is claimed that an extra repulsive force could ultimately prevent caustics from forming. 

Perturbation theory - ghosts, tachyons, and strong coupling 

Let us now consider linearised perturbations about a Minkowski background. In what 
follows we will work in units where the emergent speed of light is given by c = 1. Given 
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the residual diffeomorphism, in Eqs. (422 1 to (424 1, we can choose a gauge defined by 



N 



1, 



N l = d i B + n i , and = (1 + 2£)<5y + hi 



(443) 



where n.j is a divergence-free vector, and fry is a transverse-tracefree tensor. To study 
the propagating degrees of freedom we neglect the matter contribution, and integrate out 
the constraints. As it is non-local, the Hamiltonian constraint does not affect the local 
propagating degrees of freedom. Meanwhile, the momentum constraint yields 



D 



1 

sfA 



c, 



and 



0. 



(444) 



where 



1 - A 



- 3A- 1- (445) 
This will shortly be identified with the scalar speed of sound at low energies. Plugging 
this into the action and expanding to quadratic order one finds |1170| I926j 



S 



1 



8nG 



dtd x 



1 



C + CO, 



<) + i (' 



where Sj nt denotes the interactions and 



h ij O t hi 



Sir, 



(446) 



A- 



A 2 

k 2 



k 4 



Ot = A 



A 2 



A 3 

K UV 



(447) 



where /i and are dimensionless parameters of order one. Here we assume that all 
Lorentz symmetry breaking terms in V4 and V§ depend on roughly the same scale, kjjy < 
1/V87tG- The dispersion relation for the scalar is given by 



k 2 



k 2 



k 6 
k^~ 



(448) 



Now we see a problem: At low energies, k <C kuv, w e require c 2 > to avoid a tachy- 
onic instability, where we identify c s with the speed of propagation of the scalar waves. 



However, as we see from the action (446), c s > yields a ghost, which is far more trou- 



bling. We therefore take c 2 < 0, but with \c s \ being small so as to render the tachyonic 
instability mild. But how small does \c s \ need to be? The timescale of this instability 
is t s ~ l/|c s |fe > l/\c s \kuv 1 ancl as this would need to exceed the age of the universe, 
we infer |c s | < Ho/kuv, where Hq is the current Hubble scale |167j . Furthermore, since 
modifications to Newton's law have been tested down to the meV scale, we may im- 
pose kuv > meV. This gives |c s | < 10 -30 , or equivalently |1 — A| < 10~ 60 , on scales 
H < k < kuv- 

What about the interaction terms? At low energies, k -C kjjVi and working up to 
cubic order we find that [7351 [1166 



'S'int. — 



1 

8ttG 



dtd 3 3 



C(^C) 2 - -.dttd^dtC 

ct A 



1 



did, 



dtC 
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aw 



(449) 



To estimate the strength of these interactions we canonically normalise the quadratic 
term ( 446 ) in the infra-red by redefining 



and C = V^G\c s \L 



(450) 



so that 




2 



3 
+ 2 



c ( f^-c 



(2c 2 s + 1) ((diCf 



(451) 



For small |c s |, we see that the largest cubic interactions become strongly coupled at a scale 
A sc ~ yjcTpTS^G- Imposing the constraint, |c s | < 10~ 30 and taking l/\/8nG — 10 18 
GeV, we find A sc < 10~ 18 eV. This lies well below scale of the UV corrections given 
by kjjv > meV so we can certainly trust the effective low energy description we have 
used to derive this scale. The implications for the theory are profound. The scale 
A sc represents the scale at which perturbative quantum field theory breaks down in 
Minkowski space. For scattering processes above A sc < 10~ 18 eV we must sum up the 
contribution from all multi-loop diagrams. Since the claims of renormalisability are based 
on the validity of the perturbative description at all energies, we see that much of the 
motivation for studying this theory is lost. We also note that any notion of Minkowski 
space is meaningless below distances 1/A SC > 10 8 km since one would require a scattering 
process at energies above A sc to probe its structure. In analogy with DGP gravity 



(see section 5.5) one might hope to raise the scale of strong coupling by considering 



fluctuations on curved backgrounds, for example, on the background gravitational field 
generated by the Sun [926 . However, this seems optimistic since Minkowski space is 
an excellent approximation^ 8 ] to the background geometry at distances of order 1 / A sc > 
10 8 km, so the derived scale, A sc ~ ^/|c s | 3 /87rG should still be reliable. 

4-3.2. The non-pro jectable theory 

We now consider the non-projectable theory for which the lapse function can depend 
on space, N = N(x,t), just as in General Relativity. This means the Hamiltonian con- 
straint is now local and that terms depending on = Vi log N could play an interesting 
role in the dynamics 169, 168 . Just as in the projectable case, the absence of full dif- 
feomorphism invariance allows an extra scalar mode to propagate [278J. The action is 
given by 



5 =16^G / 



K 2 



R 



T6ttG 



dtd 3 x v fqNa i a' 1 



+ (1-A) 



16ttG 



J dtd 3 XyfqNK 2 + S m + UV corrections. (452) 



At distances r > 10 8 km from the Sun, the Newtonian potential V(r) < 10 8 . 
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Again we consider small vacuum perturbations on a Minkowski background, working in 
units where c — 1. Given the reduced set of diffeomorphisms, Eqs. (422 1 to (424), we 
cannot gauge away the fluctuations in the lapse function that depend on space. Instead, 
we choose a gauge 



N = 1 



Ni = diB + m, and = (1 + 2£)£ij + h 



(453) 



where, as before, Uj is a divergence free vector, and hij is a transverse-tracefree tensor. 
Now integrating out the momentum constraint, one finds 



B 



1 



C, 



0. 



where c s is given by equation (445), but, as we will see, we do not identify it with the 
speed of sound. The Hamiltonian constraint yields [169| I1025j 

2 



X 



Expanding to quadratic order gives |169l 11025] 



S = 



inG 



dt<F 



c 2 + cac 



S\r. 



(454) 



where S- m * denotes the interactions and 



-A- 



A 2 

k 2 



k 4 



and O t 



A 2 



. A 3 

' k 4 

K uv 
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where ft, and v are dimcnsionless parameters of order one. Again we assume that all 
Lorentz symmetry breaking terms in V4 and V@ depend on roughly the same scale, k 
l/v87rG. We now write down the dispersion relation for the scalar 



uv 



< 



w 



a 



a 



-k 2 + 



k 2 



k 6 
k 4 



(456) 



It follows that the speed of propagation of the scalar waves in the non-projectable theory 
are given by 



St 



The ghost and the tachyon problems can now be avoided by simultaneously taking [1691 
IT0251 

c 2 < 0, and < a < 2. (458) 

Recall that this was not possible in the projectable case, where we had to accept the 
tachyonic instability and use the fact that this should be slow relative to a Hubble time 
to place strong bounds on |1 — A|. Since in the non-projectable case we no longer have 
such concerns regarding tachyonic instabilities, the strongest bounds on |1 — A| and \a\ 
come from preferred frame effects in the Solar System, requiring |1272) 



|i-A|>| <icr 7 . 
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(459) 



For A and a satisfying this bound, the speed of propagation of the scalar is 



, A - 1 
a 

which should not be too slow. If the scalar graviton fluctuations propagated significantly 
slower than other fields to which they couple, then those fields would decay into scalar 
gravitons via the Cerenkov process. This would be particularly worrying for light since 
it would result in photon decay and the absence of cosmic rays. To avoid this problem 
we take ~ 1, and so |1 — A| ~ \a\. Solar system constraints on these theories are 
considered further in 643 , 6431 1645] , based on the solutions presented in 684 , and black 
holes are studied in [2261 12271 [232] . 

What about the interactions? Again, focusing on the low energy theory we find 
that the quantum fluctuations on Minkowski space become strongly coupled at the scale 
A sc - y/\X- 1|/8ttG ~ y/\a\/8irG (102511552] . Note that this result can be derived using 
a direct method similar to the one presented for the projectable case in the previous 
section |1025j , or usi ng th e Stuckelberg method in the decoupling limit |692) . Now given 



the bounds in Eq. (459), it follows that the strong coupling scale is A sc < 10 15 GeV. 
If kjjv > A sc we can trust our low energy description, and the derivation of this scale. 
As explained in the projectable case, strong coupling casts serious doubts on the claims 
of renormalisability as these rely on the validity of perturbative quantum field theory. 
However, if kuv < A sc , we cannot trust our derivation of the strong coupling scale, since 
the low energy description would not be valid there |168j . In this scenario, new physics 
that softens the interactions kicks in at k ~ kjjy. This situation is reminiscent of the 
case in string theory where we introduce the string scale just below the Planck scale, 
where strong coupling would otherwise occur. 

Whilst this seems promising there are some issues facing the non-projectable the- 
ory. One of these relates to the formal structure of the theory, and in particular the 
constraint algebra, which is dynamically inconsistent. This manifests itself through the 
lapse function vanishing asymptotically for generic solutions to the constraint equations 
|603| . The asymptotically flat solutions we have just discussed represent a non-generic 
subset of measure zero in the space of all solutions. 

In the quantum version of the theory, its has been claimed that one must take A < 1/3 
in order to have a stable vacuum |1144| . This is incompatible with phenomenological 
requirements for the following reason: We expect there to be 3 fixed points in the renor- 
malisation group flow for A. These are A = 1 (diff invariance), A = 1/3 (conformal 
invariance) and A = oo [619] . Now, at low energies we require |A — 1| < 10~ 7 and 
c 2 s — A=^j- < 0. This suggests that A flows from infinity in the UV to A = 1 in the IR. 

4-3.3. Aspects of Hofava-Lifschitz cosmology 

HL gravity was first applied to cosmology in [698, 235]. As we have seen, in the UV, 
the relevant degrees of freedom have an anisotropic dispersion relation w 2 rs k 6 /kfjy. 
This is often at the root of much of the interesting cosmology that has subsequently 
arisen, including (i) a scale invariant spectrum of cosmological perturbations, without 
early time acceleration [PJ5J HE3], ( u ) cosmological bounces \TM H3H HE2 1551] ■ 
(iii) dark matter as an integration constant [924 , (iv) chirality of primordial gravity 
waves |1196] and (v) enhancement of baryon asymmetry, abundance of gravity waves, 
dark matter, and so on [927J. These latter effects occur because the modified dispersion 
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relation results in radiation scaling like 1/a 6 as opposed to 1/a 4 in the UV regime. We 
refer the reader to the following review articles on this subject [9261 11095] . 

For an FLRW universe, the background cosmology in both the projectable and non- 
projectable theories is qualitatively very similar. Choosing units where the emergent 
speed of light is c = 1, the Friedmann equation takes the following form: 

3A- 1 2 _ 8ttG ( _ i C(t) 



-H 2 = — [p+^)+V(a) 1 (460) 

where 



V(a) = ^ 



1 



(461) 



and where m and n are dimensionless parameters of order one. Here k = 0,±1 is the 
spatial curvature. We assume that the matter component with energy density, p, and 
pressure, P, satisfies the usual energy-conservation law, p + 3H(p + P) = 0, although 
this is not necessarily required in HL gravity, as we have already discussed. 

The contribution from C(t) / a 3 depends on the theory in question. For the projectable 
theory it corresponds to the "dark matter integration constant" 924 , with C(t) — > 
constant at low energies. For the non-projectable theory there is no such contribution 
and C(t) = 0. 

We immediately notice that the effective Newton's constant seen by cosmology differs 
from the one derived by comparing the low energy effective action to the Einstein-Hilbert 
action: 

2 

G cosmo — — ^G. (462) 

Although as A — > 1, at low energies, we see that G cosmo — > G. 

To see how HL cosmology can admit a bounce, consider the limiting behaviour of the 



right hand side of Eq. (460). Neglecting C(t)/a 3 and assuming p scales like 1/a 3 or 1/a 4 
we see that this goes like ~ nk^y («/a 2 ) , which is negative if nn < 0. By continuity 



this suggests that there exists a* for which the right-hand side of Eq. ( 460 ) is zero at 
a = a*. This corresponds to the position of the bounce, since at this point H = 0. 

Observational constraints on |A— 1| coming from the background cosmology have been 
studied in [4151 |4"4"5] using BAO+CMB+SNla, but they are not particularly strong. 
At la confidence level they find that |A — 1| < 0.02, which is far weaker than the 
bounds presented in previous sections. Recall that in the projectable theory, stability 
considerations require |A — 1| < 10 -60 , whereas in the non-projectable theory preferred 
frame effects require |A — 1| < 10 7 . 

Cosmological perturbations in HL gravity have also been considered (see, for example, 
[g^l^S^[3^rT^^[7T^n^^r7T51fT^5^l[^B^l[^5^j V Indeed, for the projectable theory, it 
has been claimed that scalar fluctuations on cosmological backgrounds are stable. This is 
in contrast to the corresponding fluctuations on Minkowski space 632 which are known 
to suffer from either a ghost or tachyonic instability, as we saw in section [4.3. 1| Whilst 
this may be relevant to long wavelength modes, it is of no consequence on sub-horizon 
scales where we can trust the perturbative analysis about Minkowski space, to a good 
approximation. Also, gravity waves produced during inflation have been found to be 
chiral in HL gravity, thereby representing a robust prediction of the theory |1196j . 
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4.34- The 6 CDM model 

HL gravity represents the UV completion of an interesting cosmological model, dubbed 
0CDM [171] . In this model, it is assumed that the old cosmological constant problem 
is solved in some way, such that the net contribution to the cosmological constant is 
vanishing. The model then seeks to explain the tiny, but non-zero, amount of cosmic 
acceleration that is currently observed, without any fine tuning. Indeed, it is shown 
that the model allows for a technically natural small contribution to cosmic acceleration, 
without any corrections from other scales in the theory. 

A key assumption corresponds to the fact that Lorentz invariance is broken in the 
gravitational sector. Thus the theory contains a unit time-like vector field which may be 
generic (as in Einstein- Aether theory) or expressed in terms of the gradient of a scalar 
field defining a global time (sometimes called the khronoif*^\ . The proposed acceleration 
mechanism appears generically when we assume the existence of another field, 0, which 
is taken to be invariant under shift transformations. The model is a valid effective field 
theory up to a high cut-off just a few orders of magnitude below the Planck scale, with 
a UV completion offered by HL gravity in the khronon case. 

In the absence of any matter sources (including the cosmological constant) the model 
possesses two solutions corresponding to Minkowski and de Sitter space-times. The 
former solution is unstable and the presence of an arbitrarily small amount of matter 
destroys it. The cosmological evolution of a matter-filled universe is driven to the de 
Sitter attractor, with effective equation of state w = — 1. The value of the effective 
cosmological constant on the de Sitter branch is determined by the lowest dimension 
coupling between the Goldstone field and the khronon. Remarkably, it is technically 
natural to assume this coupling to be small as it is protected from radiative corrections 
by a discrete symmetry. Thus, in the absence of a contribution from the cosmological 
constant, the current value of cosmic acceleration would not present any fine-tuning 
problem. 

Interestingly, the evolution of cosmological perturbations is different in the 0CDM 
and ACDM models. In particular, the growth of linear perturbations is enhanced in 
OCDM as compared to the standard ACDM case. The enhancement is most promi- 
nent at very large scales of order a few gigaparsecs, but extends also to shorter scales. 
Another difference is the appearance of an effective anisotropic stress, resulting in a non- 
trivial gravitational slip at very large scales. In principle, these effects may allow one to 
discriminate between 0CDM and ACDM in the near future. 

4.3.5. HMT-da Silva theory 

We have discussed in Sections |4.3. 1| and |4.3.2| how the original versions of HL gravity 
are plagued with problems at the level of both theory and phenomenology. The root 
of this is the breaking of diffeomorphism invariance and the additional scalar degree of 
freedom that propagates as a result. With this in mind, Hofava and Melby-Thompson 
(HMT) proposed a modified version of the projectable theory possessing an additional 
U(l) symmetry |621j . It is claimed that this extra symmetry removes the troublesome 
scalar degree of freedom, so that one is left with a spin-2 graviton as the only propagating 



The khronon field is naturally identified with the Stuckelberg mode in HL gravity at low energies. 
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mode. The HMT action is given by 



Shmt = — / dtd a x 

«G 



{N [KijK* -K 2 - V( qij , Rij, VfciJy) 



&°{2K l0 + ViVjv) - A(R - 2fi)| , (463) 



where IJ = R 1 ^ — + Slg 1 -', and f2 is a dimensionful coupling constant that can 

run with scale. This constant controls the scalar curvature of spatial slices, and can be 
thought of as a second cosmological constant. As with the original projectable theory, 
we assume N = N(t), with a potential V = —c 2 R+ . . . containing the usual terms up to 
dimension six in order to guarantee the 2 = 3 scaling in the UV. In addition, however, 
HMT theory contains two new fields given by A — A(x,t) and v = v(x,t). These are 
important in extending the symmetry group to U(l) x Diff(A^, J 7 ). 
The action is invariant under Diff(A^, J-), with 

SN = d t (Nf) (464) 

6N l = dt(N* f + C) + C^N* (465) 

%j = fd t qij + C^qij (466) 

5 A = d t {Af)+Cd 3 A (467) 

5v = fdtV + CdjU. (468) 

Note that v transforms as a scalar, whereas A transforms like a spatial scalar and a 
temporal vector. Indeed, A transforms exactly as the lapse function would in a non- 
projectable theory. This is not a coincidence. One can think of A as being the next to 
leading order term in the non-relativistic expansion of the lapse. Of course, one ought 
to ask why we have not included the parameter A in front of the K 2 term in the action, 
as in previous versions of HL gravity. According to [621] , the parameter A is fixed to be 
equal to one by requiring the action to be invariant under a local U(l) symmetry: 

5 A = tp-N^ii) (469) 
5v = -V> (470) 
SN 1 = JVVY- (471) 

It is this symmetry that removes the scalar graviton. Furthermore, fixing A = 1 ensures 
no conflict with observational tests of Lorentz violation at low energies. The HMT model 
has been applied to cosmology in |1259j . 

Recently, da Silva has argued that in contrast to the claims of 621 , one can ac- 
commodate A 1 and still retain the Z7(l) X Diff(.M, J-) invariance 353 . Indeed, he 
proposed the following action: 

SdaSilva =^~J dtd3x { N [KijK*' - ^K 2 - \ I>, ,. V/, />',, ) 

+i/8 <J '(2Jfi 3 - + V;VjZ/) + (1 - A)[(Ai/) 2 + 2KAv] - A(R - 2fi)| , (472) 



which is invariant under the same symmetries as (4631. Now we must subject A to the 
same constraints as before, in particular those coming from preferred frame effects. There 

149 



are claims that the extra symmetry will eliminate the scalar graviton even when A ^ 1 
|353) . although more detailed study is required to be sure. Preliminary investigations on 
this subject have been carried out in 631 . In any event, strong coupling problems have 
recently been shown to infect the matter sector of this theory |811j , unless one introduces 
a low scale of Lorentz violation, in a way that is reminiscent of ,1691 1168] . 

4- 4- Galileons 

Galileon theory |963j was originally developed by Nicolis et al. to facilitate a model 
independent analysis of a large class of modified gravity models. In each case, General 
Relativity on perturbed Minkowski space is modified by an additional single scalar field, 
the galileon, with derivative self- interactions. Although the galileon and the graviton 
both couple to matter, any direct coupling between them is neglected to leading order. 
The resulting vacuum Lagrangian is invariant under the following shift in the galileon 
field 

7T ->• TT + b^ + c. (473) 
This symmetry corresponds to a generalisation of Galilean invariance, hence the name. 



The inspiration for the model comes from DGP gravity [454] . In Section 5.5 we will see 
how the boundary effective theory on the DGP brane is well described by the following 
action, 



5 c D ff GP = J<t>x[C GR + C™*\, (474) 



where 



£ - = 16^ {i^ 



<> 2 ( hfj, v - ^hi]^ u ) 



It 
2 



(475) 



£ ° GP = li^il^^-^^^ + h 71 - (476) 



The Lagrangian (474) has two components: a linearised GR piece, £gr, and a modifica- 
tion due to the brane bending mode, C^ GP ■ It is valid in the so-called decoupling limit 
in which all interactions go to zero except the scalar self-interactions. Focusing on the 
7r-Lagrangian, £^ GP , Nicolis et al. observed that the vacuum field equations are built 
exclusively out of second derivatives, d^d^ir. In particular, this means that there are 
no terms higher than second order, ensuring a well defined Cauchy problem and avoid- 
ing any of the potential problems arising from ghosts in higher derivative theories. In 
addition there are no first or zero derivative terms which means that the 7r Lagrangian 
possesses the Galilean symmetry. This is inherited from Poincare invariance in the bulk 
[TUT?] . 

One might expect that almost any co-dimension one braneworld model with large 
distance deviations from GR will be described, in part, and in some appropriate limit, 
by a generalised ir Lagrangian possessing the Galilean symmetry. This essentially follows 
from the fact that the extrinsic curvature of the brane is ss <9 M d„7r, on scales where 
we can neglect background curvature. 

We should also note that even if there is no direct coupling to matter and therefore 
no modification of gravity, galileons are of interest in their own right as a source of 
energy-momentum. In particular, one can potentially obtain violations of the null energy 
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condition without introducing any instability 9641 1343] . Genetically, however, a single 
galileon will result in superluminality, although the situation may be improved by going 



to multi-galileon theory (see section 4.4.5 ) 



4-. 4-1. Galileon modification of gravity 

To see how to generalise the decoupling limit of DGP to a larger class of modified 
gravity theories, let us consider the amplitude, A, for the exchange of one graviton 
between two conserved sources, T M „ and T' . In General Relativity, this amplitude is 
given by 

16ttG " 

A G r 



P 2 



(477) 



where T — Tff. We are interested in the case where gravity is modified by an additional 
scalar, so that locally we have 

5A = A-Agr= \tT'. (478) 

ap z 

Such a theory can be described by the following action 

S = I d 4 x-J—,y^R{g) - C *-nd 2 TT + \ha V T^ + nT + interactions, (479) 
J lonG 2 2 

where g^ = r)^ + h^. The fluctuation is identified with the GR graviton, and 
as such, for a given source and boundary conditions, it coincides with the linearised 
solutions of GR. This statement is true to all orders in the "decoupling" limit 

a T» v 
M ph a, T^ v oo, — ^ = const, and = const, (480) 

M P i M pi 

where M p i — y/l/^nG. Note that matter is minimally coupled to the metric g^ — 
Vfj-v + h^ Vl where the physical graviton is = + 

Galileon Action and Equations of Motion 



Now suppose we consider the decoupling limit ( 480 ) with the additional assumption 
that the strength of some of the scalar self-interactions can be held fixed. This amounts 
to neglecting the back-reaction of the scalar onto the geometry so that we can consider 
it as a field on Minkowski space. We retain some of the scalar self-interactions for the 
following reason: we are interested in an 0(1) modification of GR on cosmological scales, 
but we would like this to be screened down to < 0(1O -5 ) on solar system scales. As we 
will see, the derivative self-interactions can help shut down the scalar at short distances 



through Vainshtein screening^ In the decoupling limit, the action is given by 

S[v,7rl = / d A xC GR + C«, (481) 



°See section 



5.5.4 



for a detailed discussion of the Vainshtein mechanism in DGP gravity. 
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where = £ ga i(ir, dir, ddir) + irT represents the generalisation of the 7r-Lagrangian in 
DGP gravity. 

The vacuum part of the generalised 7r-Lagrangian, C ga i(ir, dir, ddir) gives second order 
field equations, and is assumed to be Galilean invariant in the sense that C ga i — > C ga i + 
total derivative, when ir — s- it + b^x* 1 + c. What is the most general Lagrangian with this 
property? The answer is remarkably simple, and in four dimensions is given by [963(1400] 

5 

£ ga i{n, dir, ddir) = ^ c^tt, dir, ddn), (482) 
i=i 

where the a are constants, ancrH 



A = 


TT 




(483) 


C 2 = 






(484) 


c 3 = 






(485) 


U = 




(ddn) 2 ] (dir) 2 


(486) 


c 5 = 




3(d 2 ir)(ddir) 2 + 2(ddir) 3 ] {dir) 2 . 


(487) 



By construction, the variation of each component is built exclusively out of second deriva- 
tives, 




d A xCi(ir, dir, ddir) — £i(ddir), 



where 

Siiddx) = (t- l)%\ . ..S^id^ir) . . . (d^d^ir). 

Specifically, 



£ x = 1 (488) 

£ 2 = d 2 ir (489) 

£ 3 = (d 2 ir) 2 - (ddir) 2 (490) 

£± = (d 2 n f -3d 2 ir(ddir) 2 + 2(ddir f (491) 

£5 = (d 2 ir) 4 ~6(d 2 ir) 2 (ddir) 2 + 8d 2 ir (ddir) 3 + 3[(ddir) 2 } 2 -6(ddir) 4 . (492) 

It follows that the field equations for the galileon model are therefore given by the fol- 
lowing: 

-Wv-^J +••■ = 8irGT^ u , (493) 
5 

Y^d^i (it, dir, ddir) = -T. (494) 
i=i 



31 We define (ddn) n = (d ai d a2 TT)(d a2 d a3 ir) . . . (d an d ai tt). Note that we have presented the simpler 
expressions as suggested by |400l . 
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Equation (493) corresponds to the linearised Einstein equations , and so their solution, 
ha V , corresponds to the standard GR solution for a given source and boundary condi- 
tions. The modification of GR is encoded entirely in the solution of the scalar equation 



of motion, (494) 



Galileon cosmology as a weak field 

The galileon theory has been constructed in terms of a tensor and a scalar propa- 
gating on a Minkowski background. Whilst it is straightforward to understand the weak 
gravitational field in the solar system using this description, it is not clear how one should 
describe cosmology. Fortunately, at distances below the curvature scale any metric is well 
approximated by a local perturbation about Minkowski space. In what follows, local will 
mean local in both space and time, which for cosmological solutions will correspond to 
sub-Hubble distances and sub-Hubble times. 

Let us consider a spatially flat FLRW space-time. If we take our position to be given 
by x = y = and t = r = 0, then for \x\ < H^ 1 and \t\ < H we have |963j 



ds 2 = -dr 2 + a(T) 2 dy 2 



l- l -H 2 \x\ 2 + l -{2H + H 2 )t 2 



(-dt 2 + dx 2 ), (495) 



where the Hubble scale H and its time derivative H are evaluated now. We recognise 
this as a perturbation on Minkowski space in Newtonian gauge, 

ds 2 w -(1 + 2^>)dt 2 + (1 - 2<S>)dx 2 , (496) 



where the Newtonian potentials are 



32 



* = -^H 2 \x\ 2 + ~(2H + H 2 )t 2 , and $ = (497) 

For a given cosmological fluid, the corresponding GR solutions have Hubble parameter 
H GR . Since h^ v agrees with the linearised GR solution, we have h t t = —2^ GRl and 
hij = 2^ GR Sij, where 

V GR = -\h 2 gr \x\ 2 + \{2H GR + H 2 GR )t 2 . (498) 

Now in our modified theory the physical Hubble parameter (associated with h^ v ) differs 
from the corresponding GR value, H ^ H GR . Since h^ v = h^ v + 2wr] iiv , we have a 
non-trivial scalar 

7T = # - ^ GR . (499) 

Note that a Galilean transformation 7r — > ir + b fl x fl + c merely corresponds to a coordinate 
transformation i M -> x' 1 - cx^ + ~(x v x l 'b ,i — 2b„x v x tl ) in the physical metric. 



32 It may look like that since $ — >t = 2$ 7^ that this construction introduces anisotropic stress. 
This is not the case, however, as strictly speaking the condition for the absence of anisotropic stress is 
D l — Vl 1 ) = 0. It is easy to show that D' j\x\ 2 = 0, hence, no anisotropic stress is present. 
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Self- accelerating solutions 



Of particular interest are self-accelerating solutions. A self-accelerating vacuum is 
one that accelerates even in the absence of any sources for the fields h^ u and n. These 



are familiar to us from DGP gravity (see Section 5.5 1, where the self-accelerating solution 
is haunted by ghosts. Is the same true in a general galileon scenario? Or can we identify 
a scenario that admits a consistent self-accelerating solution? 

There is some ambiguity as to what is actually meant by 'self-acceleration' if the 
tadpole term J d 4 xc\~K is present in the galileon Lagrangian. The point is that at the 
level of the graviton equations of motion, the source corresponds to the vacuum energy, A. 
However, at the level of the scalar equations of motion the tadpole term, J d A x c\~k % has 
the effect of renormalising the vacuum energy seen by the tt field, A — \ A + c\. Indeed, 
in a braneworld context, one might associate the tadpole with the vacuum energy in 
the bulk. To avoid considering a simple cosmological constant, we set the bare vacuum 
energy A = 0, and require the 7r-tadpole term to vanish, c\ = 0. This guarantees that 
Minkowski space is a solution for the physical metric since the field equations are solved 
by h^v = 0, and tt — 0. Note that this Minkowski solution need not be stable, as 
in the ghost condensate scenario. On the contrary, our interest is in stable de Sitter 
solutions. Given the constraints A = 0, and c\ = 0, any de Sitter solution is necessarily 
self-accelerating. 

We now consider maximally symmetric vacua in the absence of vacuum energy, A = 0, 
and the tadpole, c\ = 0. The corresponding GR solution is always Minkowski space, 
with hn V = 0. Non-trivial solutions for the scalar tt = ft(x), however, could give rise 
to self-acceleration. A self- accelerating vacuum with de Sitter curvature H 2 would have 



tt = — jH 2 x^x^. Plugging this into the field equations (494), with c\ — 0, and T = 0, 
gives [963] 

- 2c 2 H 2 + 3c 3 # 4 - 3c 4 H 6 + '^c 5 H s = 0. (500) 

Clearly non-trivial solutions exist for suitable choices of the parameters Cj, so self- 
accelerating solutions also exist. 

Are these vacua consistent? To investigate this we need to consider fluctuations h^, 
and 5tt = tt — tt about the self-accelerating vacuum. Because of the Galilean symmetry, 
the galileon structure is preserved in the effective theory describing fluctuations, 

S huv,STT = d 4 xC GR + £3^, 



where Cs-k = 2i=i di£i(5Tr, 35tt, ddSTr) + SttT. The coefficients can be obtained from the 
coefficients in the underlying theory via a linear map d\ = MijCj, where the matrix 
My depends on the background curvature, H 2 |963j . 

There are two immediate things to consider: (i) does the spectrum of fluctuations 
contain a ghost, and (ii) does the scalar get screened on solar system scales? For a 
general galileon theory, to avoid the ghost we must choose parameters such that di > 0. 
In DGP gravity, where the ghost is known to be present on the self-accelerating branch, 



one would find (S? GP < (see Section 5.5.3 for further details). 



In order to screen the scalar at solar system scales one must appeal to the Vainshtcin 
mechanism. Again, we discuss the Vainshtein mechanism in detail in the context of DGP 

154 



gravity in Section |5.5.4| The mechanism works in exactly the same way in a general 
galileon theory. For simplicity we assume spherical symmetry for fluctuations on the 
self-accelerating background and consider the profile outside of a heavy non-relativistic 
source, T^ v = diag(p(r), 0, 0, 0). Now, it is well known that the GR solution is given by 
the standard Newtonian potential 

lV(r)| ~ ™, (501) 

where the mass of the source M = J p(r)dV. The galileon solution 5ir(r) is given by 
[55511225] 

2d 3 ( — ] +2d 4 ( — ) =— 3 . (502) 



r I \ r J \ r 



Note that £5 is identically zero when evaluated on a spherically symmetric field. At large 



distances one can neglect the higher order terms in Eq. ( 502 ) and derive the linearised 
solution 

Sit Hn (r) = -~r~j—- (503) 
Now |/i M1/ (r)| ~ \Sn hn (r)\ so we have an 0(1) modification of GR. At shorter distances, 



the non-linear terms in Equation (502 1 become important and start to dominate. This 



happens at the so-called 'Vainshtein radius', given by [223 



r v 



Depending on which of the non-linear terms dominates, the profile of the galileon field 
changes to 




M 2 r- 

y/r if the term with d$ dominates 
r if the term with e?4 dominates 

For a suitable choice of parameters one can have |ft MV (r)| 3> |5tt™ m (r)| on solar system 
scales, and might claim that the modification of gravity does indeed get screened. How- 
ever, it is important to note that the Vainshtein mechanism itself has yet to be properly 
understood in a well defined and fully covariant theory. We discuss some aspects of this 
at the end of Section f5. 5. 41 

Nonetheless, our galileon analysis indicates that self-accelerating solutions that are 
ghost-free and exhibit some form of Vainshtein screening on solar system scales could 
exist. However, there are also other concerns. Firstly, we should consider the question 
of back-reaction. Our galileon description holds provided we can neglect the effect of the 
scalar field back onto the geometry. This turns out not to be problematic provided we 
take \ck\ < M^/H 21 - 4 - |5S3] . 

More serious concerns appear when we study fluctuations about the spherically sym- 
metric solutions we have just described. These can cause problems at both the classical 
and the quantum levels. At the quantum level, one must identify the scale at which 
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the quantum fluctuations become strongly coupled, and the radius at which one can no 
longer trust the classical background. As the background solution changes with scale, so 
does the strong coupling scale. This means that there exists a critical radius at which 
the quantum effects start to dominate and one can no longer trust the classical solution. 



Aspects of strong coupling in DGP gravity are discussed in Section 5.5.4 Here we note 
that for a general galileon model the critical radius at which the theory enters a quantum 
fog can sometimes be unacceptably larg^j At the classical level, we find that fluctu- 
ations at short distances can sometimes propagate extremely slowly, so much so that a 
huge amount of Cerenkov radiation would be emitted as the earth moves through the 
solar galileon field. Indeed, to simultaneously avoid problems with Cerenkov emission 
and a low scale of strong coupling in a ghost-free theory with self-accelerating solutions, 
one must introduce a tadpole. As we have already explained, this could be considered 
undesirable as a tadpole will renormalise the vacuum energy seen by the galileon. 

Yet another problem concerns radial fluctuations at large distances. These can prop- 
agate at superluminal speeds, indicating problems for causalitjj^] This is known to be 
a problem in DGP gravity [9,, 609] and can only be avoided in the general case by elimi- 
nating all of the interaction terms. This is unacceptable since the interaction terms are 
crucial to the successful implementation of the Vainshtein mechanism. 

In summary then, while it is possible to obtain self-acceleration in a general galileon 
model that avoids some of the problems facing DGP gravity, one cannot find a com- 
pletely consistent scenario. However, the situation can be improved by the introduction 



of a second galileon |1018j . as we will discuss in Section 4.4.5 
Conformal galileon 

The conformal galileon is constructed in much the same way as the pure galileon 
we have just described, except now we demand that the relevant vacuum Lagrangian, 
^na" formal ( 7r ' ^ 7r ' ddir), is invariant under the conformal group: 



dilations 
translations 
boosts 



n(x») ->7r(&a;' i )+log& (504) 
ttIV 1 ) -> n(x^ + a») (505) 
tt(^) -> tt(AV) (506) 



special conformal , „, , „ „, , 2 



transformations 



Tr(x^) ->■ tt^ + cf"\x\ z - 2(c • x)x^) -2c- x. (507) 



33 Larger than the Schwarzschild radius of the Sun 

34 Note, however, that it has been suggested that causal paradoxes associated with superluminality do 
not always manifest themselves in theories with non-linear scalar interactions |73| . 
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It turns out that £™f ormal = £» Ci Cf nimTaai , where [963, 



aformal 



£con 
2 



formal 



c 



conformal 



£con 
4 



formal 



d 2 Tr+-{dn) 2 



(97T) 2 



< - r £ 4 - ^e- 27r (97r) 2 {4(<9tt) • (ddvr) • (fljr) 

\2e>2„ i 0^,^2121 



-4(97r) 2 9 2 7r + 3[(57r) 2 ] 2 } 
e-^£ 5 + Se^idnflci + ^[(Ott) 2 ] 3 



(508) 
(509) 

(510) 



(511) 



+ ^(dn) 2 [(dn) 2 d 2 n - (dw) ■ {ddn) • (ftr)] 



(512) 



Aspects of the conformal galileon model are studied in [3431 1964J . Violations of the null 
energy condition here can drive inflationary expansion without introducing instabilities. 
There are, however, some issues with superluminality. 

A supersymmetric version of the conformal galileon has been obtained in [687J as a 
consistent completion of the supersymmetric ghost condensate. 

4-4-£- Covariant galileon 



The galileon action (481) describes fields propagating on a Minkowski background, 



and does not represent a fully covariant theory. Although galileon theory was originally 
motivated by co-dimension one braneworld models, it is interesting to consider the four 
dimensional covariant completion of the theory in its own right. This has been worked 
out in [400J 1396) , and is given by 



-fJ 



16ttG 



p i rcov 
H + L gal 



~\~ $ matter [9 (J>vi ^Pi 



(513) 



Naturally we recognise the first term in brackets as the standard Einstein-Hilbert action, 
J d^x^f—gR. The last term corresponds to the matter action. Note that the matter 
fields are minimally coupled to the metric g^ v — f('x)g llv , where the conformal factor 
depends on tt. An obvious example would be g^ — e 27! g^ v although this is by no means 
a unique choice. Neglecting the tadpole, the covariant completion of L ga i = Xli c i^i * s 



5 Our sign conventions agree with (963] . rather than 389 
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given by C% = £ ( aCf 00 , wher^ 



^2 — 






(514) 


pcov 

L, 3 — 






(515) 


/•cot? 

i-4 — 


1 

~2 


(□tt) 2 - (VVtt) 2 - ^i?(Vvr) 2 




(516) 


/"COV 

^5 — 


1 

~2 


(□tt) 3 - 3(D7r)(VV7r) 2 + 2(VVtt) 3 








-6<v(V'V)(v I/ v ce 7r)v Q 


tt) (Vtt) 2 . 


(517) 



Note that for the 4th and 5th order terms one must introduce some non-minimal gravi- 
tational coupling to tt. This is necessary since the naive covariant completion of £4 and 
£5, with minimal couplings, results in equations of motion containing higher derivatives. 
The non-minimal coupling helps to eliminate those higher derivatives. Now although 
the field equations in our covariant theory remain at most second order in derivatives, 
Galilean invariance is broken. We will not present the field equations here since they are 
long and complicated, especially for the higher-order terms. The interested reader can 
find them in [400] , but should be mindful of the fact that the formulae for Cf v and £§ ov 
presented here differ from those in |400j by an overall factor of 4 and 5, respectively. 

In a very recent paper, covariant galileon terms are seen to arise in Kaluza-Klein com- 
pactifications of Lovelock actions |1242j . This might have been expected since the under- 
lying theory has at most second-order fields, and this is inherited by the dimensionally 
reduced theory. We discuss aspects of Lovelock gravity, and in particular, Gauss-Bonnet 
gravity, in Section |5~7| 

4-4- 3- DBI galileon 

The galileon Lagrangian C ga i can also be obtained from the non-relativistic limit 
of a probe brane in five dimensional Minkowski space [389J . The probe brane action 
corresponds to a generalisation of the DBI action, as we will now explain. We take our 
bulk coordinates to be and place the probe brane at y = tt(x). The induced 

metric on the brane is then given by g^ v = r\^ v + d^Trd v TT, from which we deduce that 
the DBI action is 

Sdbi = -A J d A x^g = J d A x- Xy/l + (<9tt) 2 . (518) 

For a slowly moving brane {dir) 2 <C 1, the leading order dynamical piece goes like 
— ^(dn) 2 . To generalise this, we first consider objects that transform covariant on the 
brane, and then build a Lagrangian from them that gives rises to field equations that are 
at most second-order. The relevant covariant objects are the extrinsic curvature, K^, 



36 We define (VVtt)™ = (V Q1 V a ^ir)(V a2 V Q 3vr) . . . (V Qa „ V q itt), Dtt = g^V^Vvir, and (Vtt) 2 
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the induced curvature, Ru, va pi and the covariant derivatives of these quantities. The 
generalised DBI action required to guarantee second-order field equations is 

Sgon-DBI = }^CjSj, (519) 



whercFH 



S2 = — I d 4 x^/^g 

d A x^l + (dn) 2 
S, = I ,/'.,- x tjK 



d A x-i [d 2 n - -f 2 (dn) • (ddn) • (dn)] 



and 



Sa = - 



d A Xy 



d x^f 



J d 4 x^g( J -2G^K^) 



(d 2 n) 2 - (ddn) 2 

+2 7 2 ((dn) • (ddn) 2 ■ (dn) - d 2 n(dn) ■ (ddn) ■ (dn)) 
3 

2„ 

- J d 4 xj 2 [(d 2 n) 3 + 2(9c>7r) 3 - 3(d 2 n)(ddn) 2 

+6"f 2 ((d 2 n)(dn) ■ (ddn) 2 ■ (dn) - (dn) ■ (ddn) 3 ■ (dn)) 
-3j 2 ((d 2 n) 2 - (ddn) 2 )(dn) ■ (ddn) ■ (dn)} , 



(520) 
(521) 
(522) 
(523) 

(524) 

(525) 
(526) 



(527) 



with the Lorentz factor 7 = + (dn) 2 . The expressions for S3 and S5 can be 

identified with the boundary terms in General Relativity and in Gauss-Bonnet gravity, 
respectively. Of course, the former is the Gibbons-Hawking term, and the latter is the 



Myers boundary term |931j . discussed in more detail in Section [5.7.3 Now, for a slowly 
moving probe in Minkowski space, it can be shown that Si ~ f dFxCi, which means that, 
neglecting the tadpole, Sgen-DBi ~ / d A xC ga i [389J . One can then recover the conformal 
galileon by considering a probe brane in AdS, and the covariant galileon by considering 
a general bulk geometry |389j . 

This procedure has recently been extended to probe branes that are curved, giving 
rise to a more general class of effective theories on curved space |545l I544j (see, also 
|221) h These represent the analogues of galileons and DBI theories living on dS± and 
AdS^, retaining the same number of symmetries as their flat space counterparts. There is 
a rich structure and in some cases the symmetries can even admit non-trivial potentials 
beyond the usual tadpoles. 



'We write (d^n)(d t , 1 d^n) . . . (a M „9 M "+ 1 7r)(9 Mre+1 7r) = (dn) ■ (ddn) n ■ (dn). 
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4-4-4- Galileon cosmology 

Galileon cosmology encompasses much more than the original model and its covariant 
completion. The cosmological behaviour of a number of models that are inspired by the 
galileon have also been investigated (see, for example, [11451 17111 17041 13751 17141 1404) ). 
These include the braiding model |714L 14041 11046] , which is described by the following 
action 

L - 1 R + K{4>,X) + G((j),X)n<t> 



S = / d 4 aV- 



16vrG 



(528) 



where X = — (V</>) 2 . Note that for K = %j-X and G = %X we recover the covariant 
galileon action up to cubic order. This model still gives rise to second-order field equations 
and admits some rich phenomenology. It is claimed that the scalar equation of state can 
cross the phantom divide without introducing any instabilities, and results in a blue 
tilt for the spectrum tensor perturbations. Constraints on the model coming from large 
scale structure and non-Gaussianity have been obtained in [693 , 903 , respectively. Non- 
Gaussianity in DBI galileon inflation has been studied in [1059 . An even more general 
class of scalar tensor theories yielding second order field equations has recently been 
presented in |402j . and is now known to be equivalent to Horndeski's general theory [623] 
in four dimensions [710] . 

It has been argued that some of these generalised models are perhaps too general 



|222) . The point is that there is no symmetry protecting the theory from large radiative 
corrections. This can spoil the functional form of the Lagrangian so much so that we 
require more input parameters than we can measure, and we lose all predictivity. In 
contrast, the pure galileon, conformal galileon and DBI galileon theories are safe against 
radiative corrections since they possess additional symmetries that control the form of the 
derivative interactions. For this reason, in the remainder of this section we will restrict 
attention to those models for which the Galilean invariance is only weakly broken, so 
that any radiative corrections that break the galileon symmetry are suppressed. 

Let us begin with early universe cosmology and the covariant galileon. This can give 
rise to inflation even in the absence of a potential |222j . The theory is radiatively safe 
because the terms that break Galilean symmetry are suppressed by powers of A/M p i, 
where A is the naive cut-off^] As with DBI inflation, fluctuations about the quasi de 
Sitter background will result in large non-Gaussianities at low sound speeds )903| 1222] , 
It has been argued that what sets this model apart is the fact that the non-Gaussianity is 
not constrained to obey Jnl ~ V c s) making it distinguishable from DBI inflation |222j . 
although this claim has been disputed in 341 . Indeed, it is interesting to note that the 
authors of 341 adopt the effective field theory approach to inflation, imposing Galilean 
symmetry on the small perturbations around the inflationary background. This permits 
additional interactions compared with [222] . but maintains stability against radiative 
corrections. They find that one can have large (observable) four point functions even 
when the three point function is small. 

We now turn to the cosmology of the late universe. The late time cosmology of the 
covariant galileon has been studied for up to cubic |299| 191 1J . quartic |520[ [23] . and 
quintic scalar interactions |3731 1956] 1369] . In the latter model, we focus on the role of the 
galileon as a dark energy field - it deviates slightly from the original galileon scenario 



38 This corresponds to the scale of the galileon self-interactions. 
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|963) because the galileon coupling to matter disappears as M p i — > oo. In any event, the 
model is described by the following action 



S = I d A X y 



— —R + + £ m (g^,i(j) 
167rG 



(529) 



where 



^ - |(V0) 2 + ^(V0) 2 D0 + ^(V0) 2 



2(D0) 2 - 2(VV0) 2 - l -R{V& 2 



+ ^( v <^) 2 [W 3 " 3(D^(VV^) 2 + 2(VV</>) 3 - eG^V^XVV^V^] . (530) 

As there is no potential, late time acceleration must be driven by the kinetic terms. 
There is a late time de Sitter solution characterised by H = Hds =constant and <f> = 
<f>ds ^constant. The existence of this fixes a relationship between 02,03,04 and C5 such 
that there are only two free parameters, given by 

a = c 4 x dS , and /? = c 5 x 5 dS , 



where Xds = V 87rG yfids I Hdsj ■ In [373| conditions are derived that guarantee the 
absence of ghosts and imaginary sounds speeds in both the tensor and the scalar sector. 
The viable region of parameter space, (a, j3), where these conditions are met is presented. 
As regards the cosmological evolution, we see that there exists a tracker solution that 
approaches the late time de Sitter attractor. In Figure [6j we plot the evolution of the 
galileon equation of state for the tracker solution, and for generic initial conditions. 
Note that the tracker has a phantom equation of state. Indeed, even for generic initial 
conditions, the galileon field is drawn into a phantom phase by the tracker. It turns 
out that the tracker solution is disfavoured by a combined data analysis (SNe, BAO, 
CMB). The generic case fares rather better, especially if we have non-zero curvature, f2fe, 
although it is still disfavoured with respect to ACDM [956 . 

Matter density perturbations have been studied in detail within the context of this 
model in [369] , where it is shown that the growth rate of matter perturbations is larger 
than in ACDM. In the generic case, for suitable choices of a and f3, we typically find that 
the growth index today is 70 < 0.4, with large variations at earlier times. This makes the 
model easily distinguishable from ACDM. Another distinguishing feature is the effective 
gravitational potential changing with time, even during matter domination. 

4-4-5- Multi-galileons 

The extension of the galileon scenario to include multiple scalar fields [10171 110181 
I1019L I396| and even arbitrary p-forms |397j has recently been developed (see [4811 480 
for earlier work). A general multi-galileon theory, in four dimensions, with N real scalar 
degrees of freedom is given by the Lagrangian [10191 11310 

5 

£ N - g ai = E all '" lm Ki::Z] ^v^-v™^. (531) 

m— 1 

where {a ll ' lm } are free parameters of the theory and — m\8 iJ ' 1 ^...8 iJ " m iJ As 

usual, summation over repeated Lorentz (Greek) and galileon indices (Latin) should be 
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Figure 6: Taken from Figure 1 in 369 . The equation of state of the galileon field, wde, versus redshift, 
z, for a = 1.37 and f) = 0.44. The evolution is given for generic initial conditions (A) and for the tracker 
(B). 



understood to be implicit. Note further that we define the m = 1 term of expression 
(531) to be a ll n il . The Lagrangian (531) is constructed so that it is invariant under 



TTi -> n t + (h)^ + a, i=l,...,N. (532) 

One might expect this to appear in the decoupling limit of some co-dimension N braneworld 
scenarios, with 7Ti , . . . , ttn corresponding to the position of the brane in the N transverse 



directions |1017| . Indeed, one can generalise the formalism discussed in Section 4.4.3 
to probe a brane of co-dimension TV and recover the multi-galileon theory in the non- 
relativistic limit [610] . 

How many free parameters are there in this theory? We can always choose a 11 "'* m to 
be symmetric so the total number free parameters is given by 

A /jV + m-l\ f (JV + m-1)! 

^ I m ) - 2^ m\(N - 1)! " ^ j 

m— 1 v ' rn — 1 

Even for N = 3, this corresponds to 55 free parameters. To reduce the number of pa- 
rameters one can consider imposing internal symmetries on the galileon fields |1019j . 



although this will break the galileon symmetry ( 532 ) . The phenomenology of spherically 
symmetric solutions with an internal SO(N) has been studied and found to suffer from 
problems with instabilities and supcrluminality, at least for standard non-derivative mat- 
ter coupling [SD]. One can also prove a generalised form a Goldstone's theorem when 
internal symmetries are present [1310] , 
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Of course, the large number of free parameters is less of an issue in the simple case 
of N = 2, dubbed bigalileon theory. The phenomenology of this theory was developed 
in detail in |1018) . Let us summarise the main results. In direct analogy with the 
single galileon case, the bigalileon theory is formulated on sub-horizon scales as fields 
propagating on Minkowski space. This time we have our GR graviton, h^v, and two 
scalar galileons, 7r and £. Only one of the scalars, 7T, say, couples directly to the trace 
of the energy-momentum tensor. The other scalar, £, couples indirectly through its 
mixing with tt, so it still has an important role to play. The governing action is given by 
[TUTTlfTOTH] 



d 4 xC 



GR 



6 



where Cqr is given by Eq. ( |475[ ), and 



(534) 



(535) 



with 



£ m , n = (m + n)W£ ...8%8»... S^Jd^n) . . . 

■ ■ ■ (d„ m d v ™n) (d Pl d^O . . . (d Pn d^) . (536) 

The physical metric is given by g^ v = r\ pv + h^, where — + Given a 

source T pui /i M „ gives the usual perturbative GR solution, and so litr}^ gives the modified 
gravity correction. The field equations for the scalars are 

T + ^ ' o, m ^ n £ m , n = 0, and S ' b m ^ n £ m , n = 0, (537) 

wherej^Ja™^ = (m + l)(a m ,„ + /3 m +i,n-i) an d b m>n = (n + l)(/3 m .„ + a m _i, n+ i). 

In contrast to the single galileon case, self-accelerating solutions can be consistent in 
some bigalileon theories. Indeed, one can choose parameters such that we simultaneously 
satisfy each of the following: (i) there is no tadpole, (ii) there is a self-accelerating vac- 
uum, (iii) fluctuations about the self-accelerating vacuum do not contain a ghost, (iv) 
spherically symmetric excitations about the self-accelerating vacuum undergo Vainshtein 
screening in the solar system, (v) fluctuations about the spherically symmetric solutions 
are never superluminal, (vi) fluctuations about the spherically symmetric solutions never 
lead to trouble with excessive emission of Cerenkov radiation, (vii) there is not an unac- 
ceptably low momentum scale for strong coupling on the spherically symmetric solution, 
and (viii) there are no problems with back-reaction on the spherically symmetric solution 
(or the vacuum). This supports the case for considering bigalileon theories as a viable 
alternative to dark energy. 

One can also develop models of self-tuning in bigalileon theory, where the vacuum en- 
ergy does not affect the four dimensional curvature. These models get around Weinberg's 
no-go theorem by breaking Poincare invariance. Unfortunately, in order for them to re- 
main compatible with solar system tests one must limit the amount of vacuum energy 
to be < meV. 



39 We define a_i, n = /3 m ,_i = 0. 
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4-5. Other Theories 

Let us now consider some further theories that have yet to be discussed. These 
are ghost condensate theories, non-metric theories, and the dark energy from curvature 
corrections approach of Piazza. 

4-5.1. Ghost condensates 

Ghost condensate theories involve introducing into the gravitational sector an extra 
scalar field, <j), with shift symmetry 

4> — > 4> + constant. (538) 

The only terms in the action that can obey this symmetry are derivative ones, and so 
the building block for this theory is taken to be 

X = d^4>. (539) 

In [590] it was shown that if the leading order term in the action has the wrong sign, 
so that 4> is a ghost field, it is still possible to construct a theory that is stable to small 
fluctuations by including terms that push X to a fixed value, so that 

(X) = C. (540) 

Theories of this type have a number of interesting properties. For a start, the non-zero 
vacuum expectation value of the ghost field signals a spontaneous breaking of Lorentz 
invariance. What is more, fluctuations in the ghost field about the vacuum expectation 
value appear linearly in the energy-momentum tensor, meaning that anti-gravity is pos- 
sible. A further interesting point is that in the weak field limit large ghost condensate 
clumps move more slowly than small clumps, with potentially interesting phenomenolog- 
ical consequences. 

It has been argued in |590| that ghost condensate fields act like the gravitational 
counterpart to the Higgs field of the standard model of particle physics. This is because 
gravitational fields propagating through the condensate acquire a massive mode, much 
like particles acquire mass while propagating through the Higgs field. Ghost condensates 
also introduce oscillatory correction to the gravitational potential, with a Jeans instability 
that grows with time. For mass parameters of the order 10 _3 eV, these corrections occur 
on spatial and temporal scales greater than Hq 1 . If the massive modes are of order 
lOMeV, however, then corrections can occur on length scales as small as 1000km, but, 
again, only on time scales greater than Hq . 

The action for ghost condensate theories can be written 

S = J d A x^-g + M 4 P(X)) , (541) 

where M is a mass scale (confined to be lmeV < M < lOMeV), and P(X) is a function 
that must have a non-zero minimum at X — C in order to be a ghost. For stability we 
then require 

P'{C) > (542) 

P'(C)+2CP"(C) > 0. (543) 
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Extremisation of the action (541), with respect to the metric, yields field equations of 
the form 



G M „ = SttGT^, (544) 

where 

T^ v = M 4 (P(X)g^ + 2P / (X)d^d„<P) , (545) 

and for simplicity we have not included a term in the action for normal matter fields. 

To see why this theory is considered a modified theory of gravity we look at the 
perturbed field equations. Writing the metric as = rj^ + h^ v , and the ghost field as 
<j> = yCt + 7r, the Lagrangian density of the theory becomes 

C = 7=7 [fbPO + F^K 2 + F 2 {X)K^K r] +•••], (546) 

where we have chosen a unitary gauge, and where 7r has been set to zero. The F n here 
are functions of X that are derived from P(X), and K is the extrinsic curvature of the 



3 dimensional hyper-surfaces of constant (f>. Diffeomorphism invariance in Eq. ( 546 1 can 
be seen to have been explicitly broken. 

Let us now consider cosmology. For a homogeneous and isotropic Universe, the equa- 
tion of motion for </> is 

j f (a 3 <i>P'(X)) = 0. (547) 



If we assume that X —> and P'(C) — > 0, as t — >• oo, then Eq. (547) tells us that 
<t> — > ±yCt. The Friedmann equation for this theory is 

2 

H 2 = ^- (2XP'(X) - P(X)) , (548) 

where the new mass parameter is m = V SwGM 2 = M 2 /Mpi. The Raychaudhuri equa- 
tion is 

2 

fl 771 

- = -—{XP'(X) + P(X)). (549) 
a 6 

Now, it can be shown that the effect of the ghost field on the expansion of the Universe 
is such it can mimic radiation domination, matter domination and vacuum domination. 
Indeed, the simple choice of P(X) = \{X — C) 2 leads to a(t) tx [Cmt) 1 ! 2 at early times, 
and a(t) cx (Cmt) 2 / 3 at late times. Adding a constant, such that P(X) = \(X — C) 2 + 
A/m 2 , leads to a period of vacuum domination. 

A general class of solutions, with matter sources included, has been studied in |755j . 
Some of these solutions combine dark matter and dark energy-like behaviour, at the 
background level. The behaviour of scalar perturbations in the ghost condensate theory 
was worked out in detail in [922J . Modified Newtonian potentials were discovered with 

* = + Qrnod (550) 

* = ^ G R + ^rnod, (551) 
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where &gr and ^GR take their standard form from General Relativity, while & m od and 
^mod are corrections due to the ghost condensate that occur in the limit where the 
wavelength of the fluctuation is larger than the symmetry breaking scale. If we consider 
the case of de Sitter space, where <fr m od = ^mod, then the evolution equations are 

„ f a k A aM 2 k 2 2 \ aM 2 k 2 , . 

d^ mod + ZH d t * mod + - + ™l) **od = mi -^GR, (552) 

where a is a combination of dimensionless coefficients of 0(1) from the action. These 
equations shows that the Newtonian part of the potential seeds the modified part. 

4-5.2. Non-metric gravity 

We will now describe the non-metric gravity theory that deforms GR while keeping 
only two dynamical degrees of freedom 136] I137| 11381 I747| I748j . In this theory the 
fundamental gravitational object is no longer the metric but a triple of 2-forms B 1 = 
B* dx 1 * A dx v , where lower-case Latin indices denote internal SU(2) indices and take 
values from 1 to 3. The space-time metric is an emergent variable and is given in terms 
of B\ v as 

y/=gg M u cx e^B^B^B^k (553) 

where l a ^ lS is the completely antisymmetric tensor density having components ±1 in 
any coordinate system. The proportionality symbol is used above, rather than equality, 
because the metric is defined only up to conformal rescalings. The reason for this is that 
B l is self-dual, i.e. \e B l pa = iB l ^ v is a conformally invariant relation. 

The class of theories we will now describe contains only two propagating degrees of 
freedom [139| 513] . just like GR. Spherically symmetric solutions, as well as black holes, 
have been studied |751| I752[ 1648) , and extensions of these ideas to bimetric theories have 
also been considered |1 175] . Let us now describe the kinematical setup of the theory 
before proceeding to discuss its dynamics. 

Kinematics 

Consider the set of 1-forms {e°, e 1 }, the tetrad, where the capital Latin indices denote 
internal SO (3) indices (note that this is a different space than SU(2) considered above) 
such that 

ds 2 = g ilv dx> x dx v = -e°e° + (554) 

From the tetrad we can also define the self-dual 2-forms S/ + o and similarly the anti 
self-dual 2-forms .), by 

^ ± =ie° Ae 1 t^jk^ /\e K . (555) 

Any other self-dual 2-form can then be decomposed in terms of I^fjs- ln particular, we 
may write 

g=B\g {+) . (556) 
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From £T we can then define the connection one-forms A 1 a^ 40 | 
The above equation can be solved to get 



4 = 2^B B>OPSikB ^ VBk ^ (558) 



where det-B = — ^g£ ljk B l v B J V p B k p M . Now, since B 1 is conformally invariant, we know 
that A 1 is too. We can now proceed and define the curvature two-forms F l of A 1 as 



g = d£ + ^e l jk A j AA k . (559) 



Dynamics 



The action for this theory takes on the form of the well known BF-theory: 



5 l] B i A F 3 [A] - \v{B l AB r ) 



S n 



(560) 



where T^(M) is a holomorphic function of a complex symmetric 3x3 matrix, M, that 
is required to be homogeneous of degree one (i.e. V(XM.) = \V(M)) so that when it 
is applied to a four-form such as B l A B 3 the result is also a four-form. We introduce 
the internal metric h 13 = B t I B 3 j6 IJ and further decompose it into trace and traceless 
parts as h ij = \h (<5« + F« ^ 
V{h i3 ) = \hU{H i3 ), and expand U as 



where h = Sijli* 3 and 8ijH VJ — 0. We can then write V as 



C/(H)=A o -^trH 2 + 0(H 3 ), 



(561) 



where the constant Ao plays the role of the cosmological constant, while the constant £ is 
a new scale that describes deviations from GR. The minus sign in the 2nd term above is 
required to avoid instabilities. In particular, as I — > the theory reduces to the Plebanski 



formulation of GR with a cosmological constant (see Section 2.3.31 



As discussed above, the metric here is defined only up to conformal transformations. 
In order to couple the theory to matter fields we have to fix this ambiguity, which can 
be achieved by the introduction of a new function R(h lJ ) that is also homogeneous of 
degree one. The conformal freedom is then fixed by requirinj^] R(h.) = 1. In a similar 
fashion to V, we can then decompose R as R = hhU m (H), and expand U m as 



1 - -trH 2 + C(H 3 ), 



(562) 



40 Strictly speaking this defines a three-form which is then dualised to a one-form. 

41 This method of fixing the conformal ambiguity can be shown to arise naturally by considering the 



motion of a test body |749| . 
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where g is a dimensionless constant that can be of any sign. This new parameter measures 
the departure from the Urbantke metric, given by an equality in Eq. (553). Rather than 
the two new parameters, g and £, it is sometimes convenient to use the two dimensionless 
parameters j3 and 7, defined by 



and 



1 



7 = 



e 2 A 



(563) 



Cosmological consideration then tell us that < g < 1, and hence — | < j3 < | and 
7 > [753] . General Relativity is recovered in the limit 7 — > 00. 

Variation of Eq. ( 560 ) with respect to A gives Eq. ( 557 ) , while variation with respect 
to B gives 



SijB^Fi = B\ 



dU 



1 



AS 11 - 2nGT 



^ + -A & 



f>j yJ 



8nGT u X 



where A and A m are the Legendre transforms of U and U m respectively, i.e. 



A = U 



dU 



and 



A, 



U„ 



OUrn 

dm 



H'- 



=(-)' 

(564) 



(565) 



and where T IJ = T^T, I fiX I] J ^ x , where T^ v is the traceless part of the energy-momentum 
tensor, [75U] . 



Cosmology 

The cosmology of this theory has been analysed by Krasnov and Shtanov at the level 
of perturbed FLRW solutions [753] . Let us first consider the FLRW solutions of this 
theory. For homogeneous and isotropic spaces we have B u — 6 . Hence, we can drop 
the distinction between i and / and let B* = S\ We also have H l i = 0, so U = Aq and 
U m = 1, resulting in R = and V — ^hAo. The fixing condition R = 1 then gives 
h = 3 and V = Aq. Under these conditions the field equations can be written 

£* = Ia ¥ - 2nG(P 2nG{p + P)|\ (566) 

o o 

For the homogeneous and isotropic space-time we also have e° = adr and e l = adx 1 . 
After some algebra this gives A j = iHdx j and £ l = iU'dr A dx j - 2 < ' ,h )i J d.r l A dx k . 
The field equations can then be written 

m 2 = MGa 2 {p + p K ), (567) 

where Pa = — PA with Ao = 8ttGp\, and 

~2H'-H 2 = 87rGa 2 (P + P A ), (568) 

where p and P do not include the cosmological constant. Thus, for metric backgrounds 
the FLRW solutions of this theory are the same as those of General Relativity. The 
situation changes, however, when we consider linear fluctuations. In this case one gets 
departures from ACDM that depend on g and I. 
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We now consider the perturbed space-time metric in the conformal Newtonian gauge. 
The perturbation for is then given (after some convenient gauge-fixing) in terms of 
a new scalar mode x as 

B i i = S i J + i^D i jX . (569) 
The perturbed field equations are then given by 753 

-fc 2 $ = 4TrGa 2 p [S + 3K(1 + w)0] + -^k 2 \k 2 x - WHjA , (570) 

3ar L J 

= 4nGa 2 {p + P)6- 7^k 2 X ', (571) 

k 2 k 2 
$" + 2U& + WS' + (2H' + U 2 ) *+—($-*) = ^Ga 2 5P + (572) 

$-$ = 8iG ffl 2 (p+P)S+i^"+y X j, (573) 

and one can show that the Bianchi identities arc satisfied independently of the x terms. 
In this sense the x field is non-dynamical. The remaining equations determine x m terms 
of $ and ^ as 

X " - 2Ux' - [A + m 2 + A o7 + 8irGf3(p - 3P)] x + a 2 ($ + *) = 0, (574) 

where p and P does not include the cosmological constant. This equation can be solved 
to get x m terms of $ -I- "J, hence the x terms in the field equations can be thought of as 
non-local modifications of the Einstein equations. 

Krasnov and Shtanov also find the vector and tensor mode equations [753J. Further- 
more, they study the evolution of perturbations during inflation, a well as radiation, 
matter and A dominated epochs, and estimate the effects of the modifications on the 
matter power spectrum. 

4-. 5. 3. Dark energy from curvature corrections 

A proposal for IR modifications of gravity has been put forward by Piazza [1036, 1037j . 
The starting point for this is the usual semi-classical gravity, where matter fields are 
quantised on a curved background manifold. The operators of the matter field theory are 
then modified in the IR in a way we will now describe. Schematically, in a cosmological 
setup, operators corresponding to Fourier modes of physical momentum k are corrected 
by terms of order H 2 /k 2 , where H is the Hubble parameter. These modifications lead 
to the apparent existence of Dark Energy, but without introducing a new scale in the 
problem. 

To illustrate this idea consider the vacuum expectation value of the local energy 
density of a massless fielcp 2 ] 

(T° (t,x)) bare = J d 3 k 

— local terms + non-local terms, 



ZquadW /logW 



k 3 



(575) 



see e.g. 1298, 516 for the explicit expression of a massive scalar on flat FLRW background. 
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where spatial homogeneity has been assumed for simplicity. The local terms can be re- 
moved by local gravitational counter-terms, while the non-local pieces represent the gen- 
uine particle/energy content of the chosen "vacuum" state. The first term contributes to 
the cosmological constant, and in flat space-time can be removed by the usual procedure 
of normal-ordering (the /'s vanish in flat space-time). In curved space-time, however, the 
presence of the time-dependent /'s makes the normal-ordering procedure meaningless. 
The conjecture of [10361 11037j is that there exists a theory that resembles semi-classical 
GR on small scales, but that has an IR-completion that prohibits the time dependent 



pieces in ( 575 1 . If that is the case then one can still deal with the cosmological constant 
term by the usual procedure of normal-ordering, as in flat space-time. 

To try to construct such a theory |1036|, H037J propose what they call the Ultra 
Strong Equivalence Principle: For each matter field or sector sufficiently decoupled from 
all other matter fields, there exists a state (the "vacuum") for which the expectation 
value of the (bare) energy-momentum tensor is the same as in flat space, regardless of 
the configuration of the gravitational field. 

What this principle aims to achieve is to remove the time-dependent terms in Eq. 



( 575 1 by appropriate modifications of semi-classical gravity that manifest themselves 
when the Fourier modes have wavelengths comparable to the inverse extrinsic curvature 
(i.e. the inverse Hubble radius H^ 1 ). At the present, a complete theory that imple- 
ments this idea is lacking, but a toy-model with massive scalar fields has been considered 
in |1036[ 11037] . Letting n be the comoving momentum that labels operators in Fourier 
space (related to physical momentum as n/a), the modification to 0(H 2 a 2 /n 2 ) is given 
by the modified commutation relation 



where is the annihilation operator. This prescription is equivalent to using the 



standard commutation relation 



A (o) A m 



n — ft') for the standard operator 

A^2\ but with a modified comoving momentum given by k — ft (l — H 2 ^ j that locally 

defines the infinitesimal translations. In a local neighbourhood (smaller than a Hubble 
patch) the above pres cription can be shown to cancel the quadratically divergent piece 



/quadW in Ec l- < 575 



Note that the momentum k is not conserved, but ft is. 



To extend the above to the global picture one can use the translation operator 
e- iAP<1) , where P« = f d 3 n n A { } ] = P« = f d 3 n k A^ A { ? } is the momentum 
operator constructed with the modified Fourier modes, and A is the comoving proper 
distance to a point far away from the origin. In GR the comoving distance A = d(t)/a(t) 
is a constant given by the ratio of the physical distance, d(t), to the scale factor, a(t). 
However, in the present theory one finds instead that 

A = ^j^H 2 ) + higher orders. (577) 



Comoving distances obeying Eq. (5771 are, in fact, already strongly disfavoured by ob- 
servations 958 . One may, however, try to explore further whether the dynamical Hubble 
scale H(t) itself could provide the scale required by cosmic acceleration by considering 
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the more general expansion 

A = AiXH + A 2 (XH) 2 + . . . 

+B 1 X 2 j t (aH) + B 2 X Z j t {a 2 H 2 ) + ..., (578) 

where Ai and Bi are a set of constants. The authors find that certain regions of the 
resulting parameter space can fit the data as well as ACDM. 
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5. Higher Dimensional Theories of Gravity 

The first systematic studies of higher dimensional geometry date back to the likes 
of Riemann, Cayley and Grassmann in the mid nineteenth century. It lies at the heart 
of General Relativity, where space and time form part of a curved 3 + 1 dimensional 
manifold, as described in Section [2] Of course, Riemannian geometry is not restricted to 
3+1 dimensions, so we have the tools to study gravitational theories in higher dimensions. 
Indeed, this is more than just a theoretical curiosity. Superstring theory, arguably our 
best candidate for a quantum theory of gravity, can only be formulated consistently in 
10 dimensions. 

The problem now is a phenomenological one: Gravity does not behave like a 10 
dimensional force in our experiments and observations. Perhaps the simplest observation 
along these lines is the stability of earth's orbit. In D dimensions of space-time, the 
Newtonian potential due to a point source will typically go like l/r D ~ 3 . For D + 1 4, it 
follows that we cannot have stable planetary orbits, and so it is clear that gravity should 
not appear 10 dimensional on solar system scales. We use the word appear, because there 
exist gravitational models where the extra dimensions are hidden from experiment, but 
which open up at shorter and/or larger distances. 

In this section we will review various models of higher dimensional gravity that have 
been proposed. We will only discuss the case of extra spatial dimensions, although extra 
temporal dimensions have been studied (see eg |1141j ). One might worry that extra 
temporal dimensions lead to problems with causality, as they permit closed time-like 
curves in the form of circles in the plane of the two temporal directions. 

5.1. Kaluza- Klein Theories of Gravity 

Kaluza-Klein (KK) theory grew out of an attempt to unify gravity and electrody- 
namics [9851 [6731 [7011 [702] • The basic idea was to consider General Relativity on a 4 + 1 
dimensional manifold where one of the spatial dimensions was taken to be small and 
compact. One can perform a harmonic expansion of all fields along the extra dimension, 
and compute an effective 3 + 1 dimensional theory by integrating out the heavy modes. 
This idea has been embraced by string theorists who compactify 10 dimensional string 
theories and 11 dimensional supergravity/M-theory on compact manifolds of 6 or 7 di- 
mensions respectively, often switching on fluxes and wrapping branes on the compact 
space (see |552j for a review). Each different compactification gives a different effective 
4-dimensional theory, so much so that we now talk about an entire landscape of effective 
theories [TTM] . 

Assuming that the extra dimensions have been stabilised, the late-time dynamics of 
KK theories is most easily understood at the level of the AD effective theory. As we will 
show, this will generically correspond to a AD gravity theory with extra fields, examples 
of which are studied in detail in Section [3] At early times, when the 3 dimensional space 
is comparable in size to the extra dimensions, the effective description clearly breaks 
down. This forms the basis of KK cosmology where one can ask the deeply profound 
question of why and how the 3 extended dimensions of space were able to grow large, 
while the extra dimensions remained microscopically small. It seems fair to say that a 
fully satisfactory answer to this question has yet to emerge. 
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We will now discuss some aspects of KK theory, starting with an overview of dimen- 
sional reduction and effective theory before moving on to a discussion of KK cosmology 
at early times. For a more detailed review of KK theory see [751 H005 . 



5.1.1. Kaluza- Klein compactifications 

To understand the generic features of KK compactifications, it is sufficient to describe 
the dimensional reduction of General Relativity on a circle, S 1 . We first define General 
Relativity in D = d + 1 dimensions, via the generalised Einstein-Hilbert action 

(579) 

where Gd is Newton's constant in D dimensions, ^ab is the D dimensional metric 
with corresponding Ricci tensor, IZab, arid Ricci scalar, TZ — r y AB lZAB- Note that we 
are neglecting the matter Lagrangian for brevity. We are assuming that one of the 
spatial dimensions is compactified on a circle of radius L/2n. To this end we can define 
coordinates X A = (x^, z), where the coordinate z lies along the compact direction, such 
that < z < L. 

We can expand the metric as a Fourier series of the form 

1 AB(x,z) = Y / lA n B (xy nz/L - (580) 

n 

We find that this gives an infinite number of extra fields in d dimensions. Modes with 
n =/= correspond to massive fields with mass \n\/ L, whereas the zero mode corresponds 
to a massless field. As we take L to be smaller and smaller we see that the mass of 
the first massive field becomes very large. This means that if we compactify on a small 
enough circle we can truncate to massless modes in the 4-dimensional theory. Massive 
modes will only get excited by scattering processes whose energy lies at or above the 
compactification scale 1/L. This also applies to matter fields arising in particle physics. 
Indeed, particle physics imposes by far the strongest constraints on the size of the extra 
dimension. Standard Model processes have been well tested with great precision down 
to distances of the order ^(TeV)" 1 , with no evidence of extra dimensions yet emerging 
|941) . Assuming that the extra dimensions are universal, that is the Standard Model 
fields can extend all the way into them, we infer that L < 1CP 19 m. The natural scale of 
the compact dimensions is usually taken be Planckian, L ~ l p i. 

Let us now focus on the zero modes, Jab( x )- We could define 7^° and jiz to 
be the d-dimensional fields g )JlV (x), A fJi (x) and 4>{x). In effective field theory language, 
these will correspond to the metric, gauge field, and dilaton, respectively. In order that 
our results are more transparent we will actually define the components of the metric in 
the following way: 

7$ = ^ V + iSS = <P*4* T$ = (581) 



where a = \j \j2{d — — 2), and (3 = —(d — 2)a. Since we have truncated to the 



massless fields, we can integrate out the z part of the action given in Eq. (580). Wc find 
that the c?-dimensional effective action is then given by 

Sefflff, A, <f] = J ddx ^ ( R ~ - \e- 2{d - l)a *F^j , (582) 
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where F 2 — F^ V F^ U and F^ = V^A^ — V„ A M is the electromagnetic field strength. The 
curvature associated with the d dimensional metric, g^ v , is described by the Ricci tensor, 
and Ricci scalar, R = g^R^. What we now have is an Einstein-Maxwell-Dilaton 
system in d dimensions. Of course, Kaluza and Klein were particularly interested in the 
case of d = 4. They were frustrated by the presence of the dilaton, (/>, in the resulting 
4-dimensional effective theory. The point is that one cannot simply set the dilaton to 
zero and retain a non-trivial Maxwell field, since this would be in conflict with the field 



equations arising from Eq. (580), 
1 



, (583) 



V [e-^-^F^ = 0, (584) 
D<f> = ~{d- l)ae- 2(d - 1)a *F 2 , (585) 

where = R^ — -^Rg^ is the Einstein tensor in d dimensions. In the usual jargon, 
switching off the dilaton does not represent a consistent truncation of the higher dimen- 
sional theory [440] . We should also note that the physical size of the compact dimension 
is not necessarily given by L, but by Le^^ x \ If L is to represent an accurate measure 
of the compactification scale, we are therefore implicitly assuming that is stabilised 
close to zero. For this to happen we need to generate a potential for <f> that admits a 
stable solution- this is known as the problem of moduli stabilisation. In more general 
compactifications, moduli potentials can be generated by Casimir effects of fields in the 
compact space [541 1551 |2"3"51 13351 1336j . but the moduli remain unstable [336] . In fact, the 
problem of moduli stabilisation has only recently been solved by switching on fluxes to 
stabilise the volume of the compact space [6621 1661j . 

There are, of course, many different compactifications that have been studied in the 
literature, a detailed analysis of which is clearly beyond the scope of this review (see 
|552) V However, aside from details such as the inclusion of fluxes and branes on the 
compact space, the general scheme of each compactification is the same as the one we 
have just described. Typically, a compactification of, say, 11 dimensional super-gravity 
down to four dimensions will give rise to a gravity theory with a plethora of extra fields. 
These extra fields include scalars, pseudo-scalars, vectors, pseudo-vectors, and arbitrary 
p-forms. Modifications of gravity due to extra fields are studied in detail in Section [3j 



5.1.2. Kaluza- Klein cosmology 

As we have just discussed, for a phenomenologically viable theory with compact (and 
stabilised) extra dimensions, the characteristic size, L, of the compact manifold should 
not exceed the scale probed by modern collider experiments, which is currently around 
10~ 19 m |941) . It is amusing to compare this to the characteristic size of the 3 large spatial 
dimensions, which is at least a Hubble length c/H ~ 10 26 m, or in other words at least 
45 orders of magnitude greater. Of course, it was not always like that. In the very early 
universe, at times t < L/c, one might expect that all spatial dimensions were of the same 
scale, each playing an equally important role in the dynamical evolution. This begs the 
question: Why did the Universe evolve into a state where just 3 spatial dimensions grew to 
macroscopic scales? Put another way, how does one achieve a dynamical compactification 
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mechanism in the early Universe such that 3 spatial dimensions expand exponentially 
to an extremely large size, in contrast to the remaining spatial dimensions? Were those 
extra dimensions somehow prevented from growing beyond a certain size, or did they 
grow initially and later contract towards their current state? 

These questions have led many authors (see, e.g. [514 1 133 ] 1750 1 HUM! H0831 fT085l U| 131 
12) 1997(1555] ) to consider the dynamics of anisotropic cosmologies in D = d+1 dimensions, 
where d = n + n. Indeed, consider the Bianchi-typc metric 



ds 2 



"f ab dX a dX" = -dt 2 + a 2 (t) qij {x)dx l dx 3 + a 2 \t)q mn {x)dx m dx n , 



(586) 



where the coordinates x % run over the n spatial dimensions and the coordinates x m 
run over the n spatial dimensions. The n-dimensional metric, qij(x), is taken to have 
constant curvature k, whereas the n-dimensional metric, q m n{S:), is taken to have constant 
curvature k. The growth of these two spaces is controlled by the relevant scale factors 
a(t) and a(t). Naturally, we will be interested in the case of n = 3, but for the moment 
let us keep things general. 

We now apply Einstein's equations in D = d + 1 dimensions, 



Gab = T^-ab - ^Ti-lab = ^G D T ab , 

where the energy-momentum tensor is given by an anisotropic fluid, 



(587) 



n - diag 



v 




As usual, p(t) is the energy density, whereas pit) is the pressure along t he n dimensions 
and P(t) is the pressure along the n dimensions. Einstein's Equations (587) then yield 
the following [514] 



(588) 

°" 1 ^ " 6 l (589) 

(590) 



2(„-l)(ff' + £) + f(n-l) 
f + {n-l)(H 2 + +fiHH 



d-l 



nHH = 



8-itG d 
d-l 



- nnHH = 8ttGdPi 
p+(h-l)P-nP , 
p- nP + (n- l)P] , 



where H — a/a and H — a/ a are the Hubble parameters of the two expanding/contracting 
spaces. Of course, we also have energy conservation, which gives 



p + nH{p + P)+ nH(p + P) = 0. 



(591) 



Note that we do not necessarily have to assume that the cosmological dynamics is gov- 
erned by D-dimensional General Relativity. We can also consider modifications of GR 
where additional fields are present. For example, in string gas cosmology |183l 11216] , we 
consider the action 



S = 



1 



16ttG, 



d D x, 



^e- 2 ^ (TZ - 4(V0) 2 - c) + S m [ 7 , tf n ], 
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(592) 



where S m is the matter part of the action, containing the string gas, and the constant 
c vanishes in the critical cas^] but not otherwise. The resulting field equations can be 
written in the form G a b = SitGoTab, where 



T - 1 
ab ~ SnG, 



~7»a + 8V a 0V,0 - 6 7Qb (V(/») 2 - 2(V a V 6 - TaftD)^] + e 2 ^ n) , (593) 



and — — ^=|^gj. The scalar equation of motion just follows from energy conser- 
vation, V Q T ab = 0. 



Let us return to Equations (588)- (590) with a view toward dynamical compactifi- 



cation. Many of the earlier works [31 [TU551 HU551 [Q |3J focus on isotropic 

perfect fluids, for which P = P = wp. For simplicity and definiteness, let us follow the 
analysis of Abbott, Barr and Ellis pQ. We consider an epoch in which we have radiation 
domination, w = l/(n + n), so that the evolution equations read 



|(n - 1) (H 2 + + f (n- 1) (ff 2 + f J + nfifffT = 8ttG dP , (594) 
f + (n - 1) (^ 2 + + fifflf - Ssf^p, (595) 
§ + (n- 1) (h 2 + §) +nHH = i ^f s -p. (596) 



Now, if the n dimensions are taken to be an n-sphere (k > 0), it is clear from Equation 



(596) that they will reach a maximum size when H = 0, and will subsequently start to 
recollapse. In contrast, we can take the n dimensions to be flat or hyperbolic (k < 0), so 
that these dimensions will never turn around. In fact, one can show that as we start to 
approach the singularity of the collapsing sub-space (a(t) — > 0), the n dimensions enter 
a phase of accelerated expansion. To see this note that H starts to become large and 



negative, and so it is clear from Equation (595) that we will enter a phase with a > 



The typical evolution of the two scale factors is shown in Figure [7J Note that we can 
even allow for a sufficiently small n > and still retain this qualitative behaviour. The 
upper bound on n follows from demanding that the turnaround in a(t) occurs after the 
turnaround in a(t). The bound is not strong enough to be interesting: It merely implies 
that today's Universe is larger than the horizon p]. 

Of course, it is clear that the classical equations will start to break down in the 
neighbourhood of the singular point. The physical radius of the n-sphere is a/yk, so 
we certainly would not expect to trust our field equations when a(t) < \fkLj) 1 where 
Ljj cx G 1 ^ ^ is the fundamental Planck length in D dimensions. In Q], it is assumed 
that quantum gravity effects will ultimately stabilise the size of the internal space, ending 
the inflationary phase at some time t c , where a(t c ) — VhL, for some compactification 
scale L > Ljj . Albeit without much justification, let us accept this assumption for the 
moment, and consider the physically interesting case of n = 3. One might hope that 
the inflationary phase is sufficiently long to offer a solution to the flatness, entropy and 
horizon problems of the standard cosmology. Consider the entropy problem in particular. 
Entropy is indeed released from the extra dimensions into the usual 3 dimensions of 



43 For the bosonic string the critical dimension is D = 26, whereas for the superstring the critical 
dimension is D = 10. 
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Figure 7: Adapted from Figure 1 in [J. The typical evolution of the scale factors in the two different 
sub-spaces. The scale factor a(t) along the n dimensions grows large, entering an inflationary phase as 
the remaining h dimensions begin to recollapse. The scale factor a(t) along the n dimension is assumed 
to be stabilised by quantum gravity effects at some time t c . 



space [331Q], but only as much as log S ~ |0(1)| logfXo/L) [730]. Since we demand that 
L > Ljj, this is clearly way short of the total required to solve the entropy problem, 
log S ~ 88. In short, KK inflation does not last long enough to provide an alternative to 
scalar field driven inflation. 

To get the required expansion of 3-dimensional space, we must therefore include 
some additional scalar fields. Adapting |615j slightly, we can mimic a period of slow-roll 



inflation by plugging a cosmological constant, A, into the Field Equations (588)- (590 1 
Setting P = P = —p, where p — A/8itGd, we find 



f (n - 1) {H 2 + + f (n - 1) ( H 2 + £)+ nhHH = A, (597) 
f + (n-l)(H 2 + ^)+nHH=^ Tl (598) 
l + {n-l)(H 2 + §\+nHH=^ l , (599) 



Again, by taking the n dimensions to be flat (n = 0), and the h dimensions to be positively 
curved (k > 0) , we find a solution for which the flat directions grow exponentially, and the 
spherical dimensions remain fixed with H — at a = a* , where H 



2 _ n-1 k _ 2A 



n a% n(d—l) ' 

The radius of the extra dimensions lies at the maximum of its potential, so this solution 
is unstable. Indeed, fluctuations reveal that the spherical dimensions collapse to zero size 
over a time scale At ~ (1 + \fl + 8/n) / AH* , after which we cannot count on exponential 
growth in the flat directions. 

We now consider the phenomenologically interesting case of n — 3. To get the required 
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number of ~ 65 e-folds of inflation along the 3 flat directions we need H*At ~ 65. For 
n = 3, H*At ~ 0.729, so once again inflation is cut short far too early. We could imagine 
getting around this problem if we could alter the potential for the radius a, such that it 
develops a minimum as well as a maximum, by switching on fluxes [662, 661 . Generically, 
it is still very difficult to get enough exponential growth along the familiar 3 dimensions 
without causing the extra dimensions to grow alongside them |615j . For further details on 
the latest attempts to embed inflation in higher dimensional theories, we refer the reader 
to [5071 UMSllST^l 15551 12I51I5751H55] . Bounds on the variation of fundamental constants 
for dynamical compactifications have been studied in [571, 572 , while PPN parameters 
for KK models in the solar system were computed in |461j . Note that Kaluza-Klcin 
cosmologies have also recently been applied to the dark energy problem |119[ 1120] . 

We end our discussion of Kaluza-Klein cosmology by asking the question: Why are 
there 3 large spatial dimensions? We have already alluded to an anthropic explanation 
demanding the existence of stable planetary orbital To this we could add the existence 
of stable atoms and chemistry, both key to the development of intelligent life, and both 
requiring no more than 3 (large) spatial dimensions. 

Modern attempts at a dynamical understanding of the dimensionality of space include 
String Gas Cosmology [183111215] (for reviews see, e.g., [1801111611181] ). Here the spatial 
dimensions are taken to be compact and precisely 3 dimensions are allowed to grow 
large due to the annihilation of strings wrapping around those dimensions. The point is 
that strings winding around compact dimensions oppose their expansion since the energy 
of the string winding modes increases with radius. To allow the compact dimensions to 
grow large the winding modes must therefore collide and annihilate with the anti-winding 
modes. Generically, we would only expect collisions of 1 + 1 dimensional strings in at 
most 3 + 1 dimensions. Thus, the dimensionality of the string controls the dimensionality 
of space by allowing at most 3 spatial dimensions to grow to macroscopic scales. Note 
that this result is not spoilt by the inclusion of branes wrapping compact directions, as 
these happen to fall out of equilibrium before the strings [21] , 

Whilst this idea has some appeal at first glance, it has not stood up to intense 
scrutiny. More detailed quantitative analyses suggest that the desired outcome is not 
at all generic, and requires highly fine tuned initial conditions [4561 14571 1458] , Whilst 
one can engineer an anisotropic set-up allowing 3 dimensions to grow large as desired, 
typically the internal dimensions also grow to large sizes, just at a slower rate [456] 1457] , 
In fact, it turns out that either all dimensions grow large since the string gas eventually 
annihilates completely, or all dimensions stay small since the string gas gets frozen out 
[458] . There are also problems at the level of cosmology. For example, when properly 
calculated, the scalar perturbations have a blue power spectrum with n = 5, which is 
strongly ruled out by observations [670] . It has been argued that a near scale invariant 
spectrum can be obtained if the dilaton gets frozen during the strong coupled Hagedorn 
phase in the very early Universe 182 . However, such claims still rely on a semi-classical 
treatment of cosmological perturbations that cannot be trusted during the Hagedorn 
phase, as the strings are strongly interacting. 

In the context of 10 dimensional string theory, other attempts to explain the dimen- 



On the subject of planetary orbits, it is amusing to note that Kepler himself reasoned that the 3-fold 
nature of the Holy Trinity was responsible for the perceived dimensionality of space. Ptolemy is reputed 
to have offered some alternative ideas in his work On Dimensionality, but they have since been lost. 
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sionality of our Universe consider that for integer values of n, the inequality 2n < 10 =>■ 
n < 4 [4441 1683] . This is interesting because it means that the world volume of 3 + 1 
dimensional branes (known as 3-branes) are less likely to intersect than those of larger 
branes. In particular, Karch and Randall [683] have shown that an FLRW universe 
initially filled with equal numbers of branes and anti-branes will ultimately come to be 
dominated by 3-branes and 7-branes. This analysis accounts for the fact that larger 
branes dilute more slowly, as well as the likelihood of intersections and annihilations 
(hence the importance of 7-branes) . In a braneworld scenario, this could explain why we 
might be more likely to find ourselves living on a 3-brane, as opposed to a larger brane. 



The consequences of living on a 3-brane are discussed in detail in the Section 5.2 



Finally, we note that for toroidal compactifications, 3 large spatial dimensions can be 
linked to the stability of the small extra dimensions, at least in the presence of solitonic 
strings/branes that correspond to point masses in the large dimensions [462 . 



5.2. The Braneworld Paradigm 

The braneworld paradigm [TSl 110741 1521 [S3] represents a radical alternative to 
the standard Kaluza-Klein scenario, discussed in the previous section. In the KK sce- 
nario, the extra dimensions must be small and compact, the size of the internal space 
constrained by collider experiments to be below the inverse TeV scale. In the braneworld 
scenario the extra dimensions can be much larger, perhaps even infinite in extent. This 
is made possible by relaxing the assumption of universal extra dimensions. 

In the braneworld picture the Standard Model fields are not universal, rather they 
are confined to lie on a 3 + 1 dimensional hyper-surface, known as the brane, embedded 
in some higher dimensional space-time, known as the bulk. Tests of Standard Model 
processes can only constrain how far the brane may extend into the bulk, or, in other 
words, the brane thickness. They do not constrain the size of the bulk itself. Such 
constraints can only come from gravitational experiments, as gravity is the only force 
that extends into the bulk space-time. As is well known, on small scales gravity is 
much weaker than the other three fundamental forces, making it difficult to test at short 
distances. In fact, the gravitational interaction has only been probed down to ~ 0.1 mm, 
with torsion-balance tests of the inverse square law [TO]. It is too simplistic, however, to 
suggest that this translates into an upper bound on the radius of the bulk. Gravity is 
intimately related to geometry, and, as we shall see, one can warp the bulk geometry such 
that an infinitely large extra dimension is still allowed by experiment. For an excellent 
introduction to large extra dimensions see |1072j . 

Before delving into a detailed discussion of the various models, we note in passing 
that the braneworld paradigm is well motivated by string theory [622. 838, 52J. As well 
as fundamental strings, string theory contains fundamental objects known as D-branes 
[1044] . These are extended objects upon which open strings can end. The braneworld 
set-up therefore has a natural interpretation in terms of a stack of D-branes embedded in 
a higher dimensional target space (see Figure [8]). Open strings, with their ends attached 
to the D-branes, can be identified with the Standard Model fields bound to the brane. 
Only closed strings can propagate through the bulk, and these are identified with the 
gravitational interactions. 
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Figure 8: Taken from |51| . Here the brane has 3 macroscopic dimensions, and du compact dimensions. 
The open strings end on the D branes, whereas the closed strings propagate through the bulk. 



5.2.1. The ADD model 

The braneworld paradigm really began to gather momentum with the seminal work 
of Arkani-Hamed, Dimopoulos and Dvali |62j . in which the large extra dimension is 
exploited in order to explain the vast hierarchy between the electro-weak scale, Mew ~ 
TeV, and the Planck scale, M p i ~ 10 16 TeV. In this scenario the hierarchy does not go 
away completely, rather it is reformulated as a hierarchy between the scale of the extra 
dimensions, \l ~ 1/L, and the electro- weak scale. The set-up is as follows: Standard 
Model fermions and gauge bosons are localised on a 3 + 1 dimensional domain wall, in 
an (effective) D = 4 + n dimensional space-time. We should clarify that D counts the 
number of macroscopic dimensions in this scenario. Any microscopic dimensions, with 
characteristic size given by the fundamental Planck length, behave as in the standard 
KK scenario described previously. 

Now, the width of the wall cannot exceed the inverse TeV scale, as explained above. 
The bulk space transverse to the wall is compact but much larger than the width of 
the wall (L ^> TeV -1 ), so much so that we can treat the wall as an infinitely thin 3- 
brane. In the simplest construction, the bulk action is then described by the generalised 
Einstcin-Hilbcrt action 



where Md is the fundamental Planck scale in D dimensions, ^ab is the D dimen- 
sional metric with corresponding Ricci tensor, TZab, and Ricci scalar, 1Z — j ab TZab- 
The Planck mass is related to the fundamental Newton's constant in D dimensions by 




(600) 
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8ttGo = M^ D . At large distances, gravitational interactions along the brane are medi- 
ated by the graviton zero mode, which has a homogeneous profile over the extra dimen- 
sions. Truncating to the zero mode, we can compute the four-dimensional effective action 
describing long distance gravity along the brane by integrating out the macroscopic extra 
dimensions. This result in 

S cS = D 2 Vn Jd^x^gR, (601) 

where g^ is the four-dimensional metric on the brane, with Ricci tensor R^ V1 and Ricci 
scalar R = g^R^y, and where the volume of the extra dimensions is given by V n oc L n . 
The effective four-dimensional Planck scale, as seen by an observer on the brane, is then 
given by 

M pi ~ M 2 D n L n - (602) 

By taking the macroscopic extra dimensions to be sufficiently large, we can eliminate 
the standard hierarchy in D dimensions, Mjj ~ Mew ~ 1 TeV, and replace it with a 
new hierarchy involving the scale of the extra dimensions, fi ~ 1/L <C Mew- This is not 
in violation of short distance gravity tests, at least in D > 6 dimensions. Indeed, in six 
dimensions one can eliminate the hierarchy even for millimetre size extra dimensions. 

In actual fact, the strongest constraints on the ADD model do not come from short 
distance gravity tests, but from astrophysics and cosmology 63 . The problem arises 
because the Kaluza-Klein modes can be extremely light, ra^K > 1 / Z/ > 10~ 4 eV, and 
extremely numerous, Nkk ~ M^/Mp < 10 32 . This means that even though each mode 
is only very weakly coupled, with strength 1/M p i, scattering processes along the brane, 
at energies E > uikki can produce a copious number of KK gravitons. 

The strongest astrophysical constraint comes from the possible emission of KK modes 
during the collapse of SN1987a. Requiring this to not be the dominant cooling processes 
imposes a lower bound on the fundamental Planck scale. For example, with n = 2 we 
have Mr, > 50 TeV, whereas for n = 3 we have M D > 3 TeV [550] . 

In cosmology, one has to worry about over-production of KK modes at high temper- 
atures, since this may destroy the standard Big Bang picture. In order to be consistent 
with Big Bang Nucleosynthesis, and the current composition of the Universe, one must 
identify a maximum temperature for the early Universe for a given fundamental scale. 
Taking the fundamental scale to be Mr, ~ 1 TeV imposes a temperature bound T < 10 
MeV for n = 2, rising to T < 10 GeV for n = 6 |588| . A higher fundamental scale will 
raise the maximum temperature, but then one loses much of the appeal of the original 
model. While a low maximum temperature is not in contradiction with cosmological 
data, it does present a challenge to models of baryogenesis and inflation. The tem- 
perature bounds can be weakened considerably if one does not require the bulk to be 
flat. For example, when the bulk manifold is a compact hyperbolic space, the maximum 
temperature can be pushed beyond the GeV scale even for n = 2 |671j . 

5.3. Randall- Sundrum Gravity 

As we have already suggested, in a generic braneworld set-up there is no obvious 
reason why one should demand that the bulk space should be flat, as in the ADD model. 
In perhaps the most celebrated braneworld model, developed soon after ADD by Randall 
and Sundrum [10511 11052j . the bulk is an anti-de Sitter space. There are two versions 
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of the Randall-Sundrum model, generally referred to as Randall-Sundrum I (RSI) 1051 
and Randall-Sundrum II (RS2) [1052J. Somewhat confusingly, the RSI model contains 
two branes, whereas the RS2 model only contains a single brane. 

The RSI model |1051j was also proposed as a resolution to the hierarchy problem. It 
improves on the ADD model as the compact extra dimension need not be so large as to 
introduce a new hierarchy. This is achieved by exploiting the exponential warp factor to 
generate a large bulk volume from a small compactification radius. In contrast, the RS2 
model [1052 contains a single brane and a non-compact extra dimension - it is infinite 
in extent. This time the bulk warp factor ensures that gravity is localised close to the 
brane, so that a brane observer only sees the gravitational effects of the extra dimension 
above scales set by the bulk curvature. 

Although the phrase Randall-Sundrum gravity really refers to these two original mod- 
els, here we extend the definition to include any model with similar features. In this 
section we are particularly interested in five-dimensional models containing 3-branes, 
with some non-trivial geometry , or "warping" , present in the bulk. We begin with an 
overview of some of the models. For further details see, for example, 1008, 845 . 

5.3.1. The RSI model 

In RSI, we have two 3-branes separated by a region of five-dimensional anti-de Sitter 
space 1051 . The branes are located at z — 0, and z — z c , and we impose Z2 symme- 
try across each brane. Neglecting Gibbons-Hawking boundary terms [536] . the action 
describing this model is given by 



M 3 

S=^ I d 4 x I dzy/=^ (R - 2A) 



2 



0"+ 



z=0 



d 4 x\/-5 (+) - <r- / d A xyJ-g(-) , (603) 



where j a b is the bulk metric, and gj^/ and g^J are the metrics on the branes at z = 
and z = z c , respectively. M5 is the five-dimensional Planck scale and is related to the 
five-dimensional Newton's constant via the standard relation G 5 = l/87rM|. We also 
include a negative bulk cosmological constant, A = —6/ 1 2 . If we fine-tune the brane 
tensions against A, such that 



<7 + = — CT_ 



6M| 



I 4tvGrV 



then we admit a background solution in which the branes exhibit four-dimensional 
Poincare invariance: 

ds 2 = e'^^n^dx^dx" + dz 2 , (604) 
for — z c < z < z c . The Z2 symmetry about z = is explicit, whereas the other boundary 



condition should be understood to be implicit. The line-element given in Eq. (604) 
contains an exponential warp factor that is displayed graphically in Figure [9] In between 
the branes we recognise the geometry to be anti-de Sitter space, written in Poincare 
coordinates. Notice the peak in the warp factor at the positive tension brane, and the 
trough at the negative tension brane. Although only a toy model, the RSI set-up is well 
motivated by a number of string theory /super-gravity constructions |622L 18381 1439j . 

182 



Figure 9: The behaviour of the warp factor in the RSI model. 



By integrating out the 4D zero-mode we are able to derive the 4D effective Planck 
scale on a given brane [1051L 11008] : 

M% = ±M|Z (l - e T2zJl ^j , (605) 

where ± labels the sign of the corresponding brane tension. In terms of the effective 
Newton's constants we have 

G± = ^ ( ^Ljt) • (606) 



Now suppose we live on the negative tension brane. If we take the fundamental Planck 
scale M 5 ~ TeV, the bulk curvature scale to be just below 1/Z ~ O.OIM5, and the 
distance between the branes to be such that z c /l ~ 35, we recover the desired effective 
Planck scale, M_ ~ m p i ~ 10 16 TeV. Thus, the hierarchy problem has been eliminated 
altogether, and not just shifted around, as in the ADD model. In contrast, the hierarchy 
is not eliminated if we live on the positive tension brane since then the effective Planck 
mass is given by M+ - e~ z ^ l M_ [TUFTl [TUUg] . 

As it stands, the RSI model is incomplete. The problem is that on either brane the 
low energy 4D effective theory is not GR, but Brans-Dicke gravity. The extra scalar 
comes from fluctuations in the brane separation, and is sometimes referred to as the 
radion |541| 1528] 1275) . The value of the Brans-Dicke parameter depends on the brane, 
and is given by [528] 

(±) 3 
*b6 = 2 



Observations require this parameter to be large (wbd > 40000, see Section 3.1.2). Note 



that for the positive tension brane can be made arbitrarily large with increasing 

brane separation. The same cannot be said for w B ^ on the negative tension brane. If 
we want to live on the negative tension brane we must generate a mass for the radion to 
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Figure 10: The behaviour of the warp factor in the RS2 model. 



suppress its fluctuations. The Goldberger-Wise mechanism does exactly that, and thus 
stabilises the distance between the brancs 



5.3.2. The RS2 model 

The RS2 model is obtained from RSI by taking the negative tension brane off to 



infinity [1052] . The geometry is then described by the metric in Eq. (604) with z c — > oo. 
The corresponding warp factor is shown in Figure [TO} the single peak at z — indicating 
that we have a single brane with positive tension. In this limit M + — > M§1, so we cannot 
eliminate the hierarchy problem as in the RSI model. Rather, the situation is more akin 
to the five-dimensional ADD model, with the curvature scale, l/l, playing the role of the 
compactification scale, fi ~ 1/L. 

What makes the RS2 model interesting is the way in which AD GR is recovered on the 
brane. As we have just seen, in RSI the observer on the positive tension brane sees a low 
energy gravity theory corresponding to Brans-Dicke gravity, with a BD parameter — 
| (e 2Zc / 1 — l). In the RS2 limit of infinite brane separation the BD scalar decouples 
and one is left with AD GR. As we will see in detail in Section |5.3.5| even though the 
bulk is infinite in extent, gravity is localised on the brane at energies below the bulk 
curvature scale, l/l. As a method to screen the extra dimensions from the low energy 
observer, this represents a radical alternative to the standard method of Kaluza-Klein 
compactifications . 

The key to gravity localisation is that the bulk volume is finite even though it has 
infinite extent. This ensures that there is a normalisable graviton zero mode, which, in 
the absence of any other massless modes, guarantees AD GR at low enough energies. 
The finite bulk volume arises because the warp factor falls off exponentially as we move 
away from the brane. Intuitively, gravity localisation occurs because the warping makes 
it difficult for the graviton to propagate too far away from the brane, so much so that 
the region of the bulk with z > I has no influence on low energy brane interactions. 

Since l/l sets the scale at which the brane observer starts to become sensitive to the 
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bulk, table top experiments of the inverse square law impose the limit l/l > 10 -4 eV. This 
translates into a lower bound on the fundamental Planck scale, M 5 > 10 5 TeV, which 
is well above the electro-weak scale. Since TeV scale gravity is not phenomenologically 
viable in this case, we abandon any discussion of the hierarchy problem for the single 
brane scenario. 

Up until now our discussion has centred around weak gravity on the brane. What 
about strong gravity in the presence of localised sources? Whilst there have been some 
interesting numerical studies (see, e.g., [12781 1758] ) exact strong gravity solutions are 
rare in RS2. One exception are the solutions for a domain wall localised on the brane 
[5611 1560] . In contrast, an exact solution describing a braneworld black hole remains 
elusivj 4 ^) The difficulty arises because the brane is an accelerated surface, so any black 
hole residing on the brane must follow an accelerated trajectory. Such an accelerated 
black hole requires knowledge of the AdS C-metric, but this solution is unknown in five 
dimensions. For a nice review of the search for braneworld black hole solutions see |558j , 
and more recently [1200] . 

RS2 and AdS/CFT 

Emparan, Fabbri and Kaloper (EFK) have suggested that a static braneworld black 
hole does not exist |470j . To understand their argument we must first recall the AdS/ CFT 
correspondence in which type IIB string theory on A0IS5 x S* 5 is conjectured to be dual to 
Af = 4 SU(N) super- Yang-Mills, in the large N limit [862] . This suggests an alternative 
description of RS2 gravity |570j : 'Gravity on an RS2 brane is dual to a strongly coupled 
conformal field theory (CFT) cut off in the UV, minimally coupled to AD gravity'. There 
is plenty of evidence for this holographic description of RS2 (see, for example, [5701 
fTTTOl HSU ITUM1 mm ISM Him IMS], and for a review Another way of 

describing the correspondence is to say [470] : l A classical source on the brane is dual 
to a quantum corrected source in four dimensions, with the quantum corrections coming 
from the strongly coupled CFT . The quantum corrections are large because of the large 
number of degrees of freedom in the large- AT limit. When applied to the problem of 
finding a black hole on the brane, this suggests that the solution should not be static 
since it should include the back-reaction of the Hawking radiation 470 (see also [1201] 'i . 
Fitzpatrick, Randall and Wiseman have disputed this interpretation, pointing out that 
the CFT is strongly coupled and may therefore carry fewer degrees of freedom [508J. 
At this point it is fair to say that as yet there has been no consensus, and the subject 
remains an active area of debate (see also [5641 15621 19661 15051 1506] ). 

Note that if we accept the EFK conjecture, we can improve the bound on the bulk 
curvature by an order of magnitude, l/l > 10 -3 eV. This is based on the existence of 
long-lived black hole X-ray binaries [471] , For smaller values of l/l these binaries would 
have already decayed. 

5.3.3. Other RS-like models 

One of the characteristic features of the Randall-Sundrum models is the structure of 
the bulk geometry, described by a non-factorisable, or warped, metric. One can embrace 



There have been many attempts to find such a solution, most of which remain unpublished. 
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this structure and consider a whole slew of interesting generalisations. Here we consider 
a class of models described by warped geometries of the form 



ds z = a\z)g tIiV (x)dx^dx v + dz z . (608) 

Many of the most interesting RS-like models exhibit quasi-localisation and give rise to 
large-distance modifications of gravity. Perhaps the most celebrated of these is the DGP 



model [454], which will be discussed in detail in Section 5.5 Other interesting examples 
include the GRS model |563j , the asymmetric brane model [10121 11013] and the CGP 
model [274] . We begin with the simplest generalisation, however, proposed by Karch and 
Randall [552] (see also [567] ). 

The Karch-Randall model 

Here we take the RS2 model and de-tune the brane tension, <j + ^ 6M|//, so that one 
no longer has Poincare invariance along the brane. For an excess tension, <r+ > 6M|/2, 
the metric g^ v is de Sitter, whereas for a tension deficit, a + < 6M§/1, g^ v is anti-de 
Sitter. The effective cosmological constant on the brane is given by 



A 4 = 3 



1 



6M| J P 



(609) 



The behaviour of the warp factor also changes. We find that 



a(z) = A cosh(c ~ \z\/ 1), c = cosh 1 (1/A), for anti-de Sitter, (610) 
a[z) = Asinh(c- \z\/l), c = sinh _1 (l/A), for de Sitter, (611) 



where A = ly/\h±/Z\. The decay of the warp factor away from the brane is greatest 
for the dS brane, cutting off the space-time at z = Ic. This means that gravity is 
more strongly localised than in the standard RS2 scenario. In contrast, for the AdS 
brane the warp factor turns around at some finite value of z. This means the bulk 
volume is infinite, and gravity is not localised at all on the AdS brane as there is no 
normalisable zero mode. Actually, when | A.4 1 is small (compared with A/5), the AdS 
brane exhibits quasi-localisation. This is because there is a normalisable mode that is 
ultra-light, with mass m^Ura-light ~ A|/M 5 <C | A4 1 |1120j . At intermediate energies, 
k ^> E ^> m u itra-iight, the light mode behaves as if it were effectively massless and one 
recovers 4D gravity. Note that there is no issue with the vDVZ discontinuity in AdS 
when rn^ ltra _ light /\\A4\ < 0.1 |721] , We say that gravity is only gwasi-localised because 
the extra-dimension opens up in the far infra-red, at energies E < m u itra— light- 
Now consider what happens when we introduce a second AdS brane, along the lines 
of an AdS generalisation of the RSI model |720j . The first thing to note is that both 
branes can have positive tension since the warp factor turns around. The second thing 
to note is that the bulk volume is rendered finite, and so we have a zero mode as well 
as the ultra-light mode. At energies above the mass of both modes gravity is mediated 
by the exchange of two spin-2 fields, one massless and one massive. This corresponds 
to a braneworld realisation of the bigravity scenarios discussed in Section |3.3| (see also 
[7231 r722l fTTTTT] ). 
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The GRS model 



The GRS model |563| was developed by Gregory, Rubakov and Sibiryakov. The set- 
up contains three Minkowski branes, one with positive tension and two with negative 
tension. The positive tension brane is Z2 symmetric and is flanked on either side by a 
section of anti-de Sitter space as far as a negative tension brane. Beyond the negative 
tension brane lies an infinite region of Minkowski space. The warp factor goes like 563 

I e Zc/l \z\ > z c . 

As the bulk volume is infinite, gravity is not localised, although the decaying warp fac- 
tor around the positive tension brane gives some degree of quasi-localisation. Again, 
although there is no zero mode, there is an ultra-light mode, and the extra dimension 
opens up at very large distances. Unfortunately, the GRS model is known to be unstable 
due to the presence of a ghost in the spectrum of linearised fluctuations [I038J. 

The asymmetric brane model 

The asymmetric brane model [10121 [T0T3] (see also 1189 ) is a single brane mode,l 
like RS2, only without Z 2 symmetry imposed across the brane. Indeed, the fundamental 
parameters in the bulk are allowed to differ on either side of the brane, including the 
bulk cosmological constant and the bulk Planck scales. Allowing the bulk Planck scales 
to differ might seem strange, but not if we imagine a string compactification down to 
five dimensions in which the dilaton is stabilised at different values on either side of a 
domain wall (the brane) . If the bulk cosmological constant and Planck scales are given 
by 

A=j- 6 ^ Z> \ M={ Ml Z> ° (613) 

[-6/1% z<0 [M 2 z<0. 

The asymmetric model then admits Minkowski branes for a suitably tuned brane tension, 
a = 3(eiMf/li + £2^2/^2)7 where t\ = ±1 and e 2 = ±1- The corresponding solutions 
have a warp factor of the form [1013] 

The parameters t\ and £2 control whether the warp factor grows (e = —1) or decays 
(e = —1) away from the brane in a given direction. The model includes RS2 as a special 
case. 

It is, however, more interesting to consider the case where one of the warp factors 
grows away from the brane while the other decays (e.g. £2 = — ei = !)• The bulk volume 
is then infinite so that there is no zero mode, but by choosing the scales appropriately 
one can engineer a degree of quasi-localisation. The point is that on the growing side the 
graviton localises at the AdS boundary, where its sees an effective AD Planck scale Mfl\. 
By taking this scale to be very large, the effect of localisation close to the AdS boundary 
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is almost decoupled from the gravitational dynamics near the brane. We should note that 
the asymmetric model also admits self-accelerating solutions, just as in the DGP model. 
In fact, the model shares a number of features with the DGP model. This is no coinci- 
dence, since the DGP model can be obtained as a limiting case of the asymmetric model 
|736) . It is known that the self-accelerating solutions of DGP contain ghost-like insta- 
bilities |273[ 110151 1559[l546] . so the same is expected to be true for the asymmetric model. 

The CGP model 

The CGP model [274] . developed by Charmousis, Gregory and Padilla, also exhibits 
quasi-localisation. It combines the main features of both the asymmetric model and 
the DGP model in that there is an induced gravity term, and asymmetry across the 
brane. Interestingly, the model contains a new type of cosmological solution that tends 
to Minkowski space at very late times, but undergoes an intermediate period of cosmic 
acceleration in the presence of ordinary matter. In fact, it corresponds to a braneworld 
realisation of the Cardassian cosmology, with H 2 ss ^f^{p + c \fp) [512) . Although a 
ghost is present when we introduce a small positive vacuum energy, this decouples in the 
Minkowski limit [710] . 

5.3.4- Action and equations of motion 

In each of the models described in our overview, we have a five dimensional bulk 
space split into a series of domains separated from each another by 3-branes. Here we 
consider the action and field equations for generic models of this type. The 3-branes may 
be thought of as the boundaries of the various domains so that the action is given by 

S= [ d 5 x^l (^-n + £ hulk ) + V / d 4 x^ [-A (MlK) + £ brano ] , 

./bulk V ^ ) brane8 ^braae 

(615) 

where 7 a f, is the bulk metric with corresponding Ricci scalar, 1Z, M5 is the bulk Planck 
scale, and £buik is the Lagrangian density describing the bulk field content. In principle 
both M5 and £buik can vary from domain to domain. For each brane g^ v is the induced 
metric and £brane is the Lagrangian density describing the field content on that particular 
brane. K = g^ v K^ v is the trace of extrinsic curvature, K^ u . This should be evaluated 
on either side of the brane as it can differ from side to side. Labelling the two sides of a 
given brane, using L and R, we define K^l^r — \C n \ L R Q\w, i- e - extrinsic curvature is 
given by the Lie derivative of the induced metric, with respect to the unit normal n a \L,R. 
The unit normal on both sides points from L to R. Note that what appears in the action 
is the jumpp'l 

A (MlK) = MlK\ R - M%K\ L . 

This corresponds to the Gibbons-Hawking boundary term [536) for the bulk domains 
on each side of the brane. The A here is not to be confused with the 3-dimensional 
Laplacian. 

Now there are two (completely equivalent) ways to treat the brane contributions at 
the level of the field equations. One approach is to treat them as delta-function sources in 



Henceforth we define the jump of any quantity Q across a brane as AQ = Q\r — Q\l- 
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the Einstein equations. However, our preferred approach is to explicitly separate the field 
equations in the bulk from the boundary conditions at the brane. The bulk equations of 
motion are then given by the bulk Einstein equations 

Ga b = n ab - l -n lab = ^r Q b b ulk , (6i6) 

where T^" lk = — J 2 — /bulk ^ 5x V ~ is the bulk energy-momentum tensor. The 
boundary conditions at are given by the Israel junction conditions 649 

A [M 2 5 (K^ - Kg,,)} = -T M b r°, (617) 

where T$ = -^fb^j^ Jbranc ^ 4a; \/~5'Cbrano is the brane energy-momentum tensor. 
Note that in each of these examples, the bulk geometry is only sourced by a cosmological 
constant, T a bulk = -M|A 7ab . 

5.3.5. Linear perturbations in RSI and RS2 

We now consider the theory of linear perturbations. For brevity, we will restrict at- 
tention to RSI and RS2, although the methods we use are fairly standard, and should 
apply to all RS-like models (for further details, see [5^51 [Tf5l [TTJ551 fTTJTTj V 



Weak gravity on a RSI brane 



It is enough to consider RSI, as RS2 can be readily obtained by taking the negative 
tension brane to infinity. Recall that the background metric, j a b, is given by Equation 



(604|, with the positive tension brane (the "+" brane) fixed at z = and the negative 
tension brane (the "— " brane) fixed at z = z c . We see that the induced metric on the 
"+" brane is given by gj^/ — rj^, and on the "— " brane by gj^) = er 2Zc / l ri, v . Note 
that we have Z2 symmetry across the branes, so we can restrict attention to < z < z c . 

We now consider small perturbations about the background, so that the metric is 
given by "f ao — j a b + Sj a b- It is convenient to choose Gaussian Normal (GN) gauge, 
defined by 

*V = hzz = 0. (618) 

Actually, this is only a partial gauge fixing. Since we have no additional bulk matter, 
we can also take the metric to be transverse and trace-free in the bulk. In other words, 
<hv = X/iv(x,z), where 

d»X» = x£ = 0. (619) 
This is known as Randall-Sundrum (RS) gauge |1052j . In RS gauge, the linearised bulk 



equations of motion, 8Q ao = j^Sjat, yield 



^ dz 2 P 



X»v = 0, (620) 



where d 2 — d^d* 1 . 

Unfortunately, we can no longer assume that the branes are fixed at z = and z = z c . 
The presence of matter on the branes will cause them to bend |528) so that they will now 
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be positioned at z — f + (x) and z = z c + /_ (x) , for some functions f± that depend only 
on the coordinates x^. This makes it difficult to apply the Israel junction conditions 
at the branes. To get round this we can apply a gauge transformation that fixes the 
position of the "+" brane, and another that fixes the position of the "— " brane |275j . 
without spoiling the Gaussian Normal condition (618). This gives rise to two coordinate 
patches that are related by a gauge transformation in the region of overlap. We will call 
them the "+" patch and the "— " patch accordingly. 

We first consider the "+" brane. To fix its position we make the following coordinate 
transformation 

z^z-f+(x), x^x^+ l -(l-e- 2z / l )d^f + . (621) 

The "+" brane is now fixed at z = 0, although the other brane is now at z = z c +f- — /+• 
It follows that the metric perturbation in the "+" patch is given by 

*V = X$(x,z) = X„u(x,z) - 1(1 - er 2z ' l )d,d v f + - jf+%„. (622) 

Similarly, to fix the position of the "— " brane we let 

z^z-f.(x), XlM ^ Xtl + l(l- e -^- z °y i )d^f_. (623) 

Now we have the "— " brane at z = z c , but with the "+" brane at z = f + — /_. The 
metric perturbation in the "— " patch is given by 

*V = X${*,*) = X»u(x,z) e - 2 (^)/ ; ) M a„/_ - jf-% v . (624) 

Now the induced metric on the "+" brane is given by g^/ = g^} + Sgffi , where 

SgiV = xiV(x, 0) = x„u(x, 0) - jf+g$>, (625) 

whereas on the "— " brane it is given by g\u) — gjxu + Sgjxu , where 

2 

Sg^v = xi~v>(x,Zc) = X^u{x,z c ) - (626) 

We are now ready to make use of the linearised Israel junction conditions given by Eq. 
(6171 at each brane to find 

A [M 2 5 S(K, U - Kg^)} (±) = * ± 6g$ - T$\ (627) 

where a± — ±6M§/1 is the tension on the "±' brane, and is the energy-momentum 
tensor for matter excitations. Now, owing to the I2 symmetry, the extrinsic curvature 
simply changes by a sign when evaluated on either side of a given brane. It follows that 
the linearised boundary conditions at each brane are given by 

d 2 X 



dz I 
d_ 2 
dz + 1 



X^v 
Xiiv 



= -S { +\ (628) 
= Sj~\ (629) 



190 



where 



r(±) - -TfiinW 



(630) 



and where = g?±\7pv is the trace of the appropriate energy-momentum tensor. 

Indeed, taking the trace of these equations, and using the fact that Xnv is traceless, we 
clearly see that matter on a brane causes it to be bend, such that 



4 



6M 5 3 ' 



-z c /l 



(631) 



). Equations 



where a± gives the warp factor at the "±" brane (i.e. a + = 1 and a_ 
(620), (628) and (629) give the governing differential equations, and a complete set of 
boundary conditions for the graviton mode x^v- We now take Fourier transforms along 
the brane directions, Q(x, ...)—> Q(p, ■ ■ •) = j^ttW I ^ x e~ lp,lX Q(x, . . .), to find that 



Xp,v (x, z) 



d A p e lp ^x^(p,z), 



where 



and 



d_ 2 

~dl + 7 



Xjiv 



o 2 e 2z ' 1 + 



= -sl?( P ), 



dz* ~ P 



d_ 2 

Yz + 1 



= -s ( -\p)- 



This system is easily solved to give 

X»u{p,z) = CW{p,z)S$-Ct-)(p,z)&-\ 

where 

1 



C {+ \p,z) 
and 

C7<->(p„ 



pdet A(p) 



h 



pAetA(p) 

Note that /„, and K n are modified Bessel's functions of integer order n [BJ, and 
det A{p) = h (pi) K x (ple z ' /l ) - K x (pi) h (ple z ^ 



From Eq. (631) we also have 



/±(*) = ± 



1 a z ± 
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d 4 p e tp » x ' 



6p ; 



2 ' 



(632) 
(633) 

(634) 

(635) 



h (ple*°^ K 2 (ple*' 1 ^ + K x (pie*'' 1 } I 2 (ple z ^ , (636) 



(pi) K 2 (ple z ' 1 ) + Ki (pi) h (ple z/l )\ . (637) 



(638) 



(639) 



To compute the metric perturbation on each of the branes we simply use Eqs.(622| and 
( 624 1 , given our knowledge of Xuv {x, z) and f±(x). At the positive tension brane we then 
have 



y (+) 



1 1 



d* P e»»" {c(+)(p,o) - ^r^0 

- pure gauge terms, (640) 



-z c /l 



lp 2 det A(p) 

whereas at the negative tension brane we have 
1 1 



f (-) _ If (-)g(-) 



Y (-) 



(2tt) 2 M| 



e -z c // 

det 



f (+) - If(+)s(+) 



pure gauge terms. (641) 



The parameters a± are crucial as they control the tensor structure of the propagator on 
the "±" branes. They are given by 



1 



1 



Zp 2 C(+)(p, 0) 



-2z c /Z 



1 



lp 2 C(-)(p,z c 



(642) 



Using the properties of modified Bessel functions [BJ, we can show that at low energies, 
p < ke~ z " /l , we have |528j 



y (+) 



(2tt) 2 



M\p 2 
2 



f (+) - 

' /if 



2^ + 37 



<<& 3 



Mlp 2 



f (-) _ If (-)g(-) 



' {IIS 



pure gauge terms, (643) 



and 



y(-) 



M 2 p 2 
2 



f(-)_ 



W BD + 1 
2w BD + 3 



M\p 2 



f (+) _ If(+)fl(+) 



pure gauge terms, (644) 



where M± and are the AD effective Planck scale and Brans-Dicke parameter on the 
"±" brane, respectively. They are given by Equations (605) and (607 1. We can now see 
explicitly how BD gravity emerges as the low energy effective theory in RSI, on both 



branes, as claimed in Section 5.3.1 
Graviton spectrum 



Let us now pause to comment on the mass spectrum for the graviton. The spectrum 
can be obtained by identifying the poles in the propagator. These are given by the 
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solutions to p 2 — —m 2 , where detA(p) = 0. This gives 



h (iml)K x {imle z " /l ^j - K x (imZ) h (imle Zc,l ^j = 

=^> Ji (ml) Yt {mle z " /l ^j - Y 1 (ml) J x (mle Zc ^ = 0, (645) 

where J\ and Y\ are Bessel's functions of order one. By solving this equation we see 
that there is a zero mode, and a tower of heavy Kaluza-Klein modes with mass splitting 
Am - l/l(e z °/ 1 - 1). 

Radion effective action 

The finite Brans-Dicke parameter indicates the presence of a massless scalar in ad- 
dition to the massless graviton. This is due to the radion, or brane bending mode, 
Sf = /+ — /_. In some RS-like models (e.g. GRS), the radion can exhibit pathological 
behaviour that can only be revealed by computing the effective action 103Sj. We will 
now briefly outline the procedure for doing this. 

Let us consider vacuum fluctuations (T^ — 0) in the scalar sector. The first thing 



to note is that the Field Equations ( 620 ) , ( 628 ) and ( 629 ) admit a solution of the form 



X^ d) =-^e 22/i <W, (646) 



where d 2 ip — and the vacuum boundary conditions require 

I 



f± c = -wzr^ ( 647 ) 



2a 2 ± 



where /™ c is the vacu um fluctuation in f±. Note that Equati on ( 631 1 implies d /™ c = 0, 
and so Equation (647) is consistent with d 2 ip — 0. Equation (647) also imposes a relation 



between /™ c and fl ac , resulting in a single free scalar degree of freedom, which we take 
to be the physical radion mode, ip. Note that the radion profile in the bulk is localised 
close to the "— " brane, in contrast to the graviton zero mode which is localised close 
to the "+" brane. We now work in the "+" patch, which has the branes positioned at 
2 = and z = z c — 8f vac . Focusing solely on the scalar sector, the metric perturbation 
is given by 

8^u = -je^I'd^ - 1(1 - e- 22/i )cW; ac - y/; ac >., h»z = 8 lzz = 0. (648) 

In order to integrate out the extra dimension, it is convenient to have both branes fixed. 
We can do this with the following coordinate transformation 

z -> z + B(z)Sr c , x, -> x, ~ e- 2z ' l d^(5r c ) f e 2 y' l B(y)dy, (649) 



where B(z) is some differentiable function for < z < z c , satisfying B(0) = and 
B(z c ) = 1. While this transformation ensures that S^/^z is still zero, the price we pay for 
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fixed branes is that we now have non- vanishing 8^j zz . More precisely, 



^%d v r/> - 1(1 - e- 2z ' l )d^f™ c - - [/; ac - (dr c )B(z)] % v 



+2e- 2 ^ l d^(Sf vac ) / e 2 y/ l B(y)dy 



h*z = K z = -2Sf vac B'(z). 



To quadratic order, the effective action is given by 

6 



S e S = 



Ml 



dz 



d A Xy 



-7 h ab 8 



Qab j2 Tofc 



l 

2 ./ z= o 



d i x^/--g(+) A (M 2 S(K^ - Kg^)) 



(+) 



+ - / d*i A /=gF) hT [A (M 2 5 5(K^ - Kg^)) 



(-) 



fl V 



(650) 
(651) 



(652) 



It turns out that 6 [G^v ~ jzl^u] and A (M|<5(if M „ — Kg^))^ — cr±h fll , are identically 
zero [1038 . After integrating out a total derivative in z, we arrive at the following 
effective action for the radion: 



•Sradion — ^ 



d 4 x (d^) 2 , 



(653) 



where M_ is given by Equation (6051. Thus, as expected, the radion, -0, behaves as 
a massless scalar, and, being localised close to the "— " brane, its coupling strength is 
controlled by the scale M_. As we move the branes further and further away from one 
another, M_ increases and the radion starts to decouple (decoupling completely in the 
RS2 limit). Note that in some models, such as GRS 563], the radion effective action 
comes in with the wrong overall sign, signalling the presence of a physical ghost |1038j . 
There is no such pathology in RSI or RS2. 

Weak gravity on a RS2 brane 

Let us now focus on RS2 by taking the negative tension brane to infinity. As z c — > oo, 
the metric on the remaining positive tension brane is given by 



Y (+) 



(2k) 



d p e 



K 2 (pi) 
MlpKx (pl) 



<f(+) 



1 + 



Kijpl) 
plK 2 (pl) —(+) (+) 

2 ' y 



pure gauge terms. (654) 



At low energies, p <C k, we again use the properties of modified Bessel functions [6] to 
show that 



Y (+) 



1 



d A p e ip »* 



M 2 p 2 



fi+) _ 1 f-(+)fl(+) 
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pure gauge terms. 



(655) 



It follows that gravity is indeed localised on the brane at low energies, with a 4D effective 
Planck scale Mj = M|i, or, equivalently, with an effective Newton's constant G4 = G5/I. 
Note that the tensor structure of the propagator matches that of General Relativity, with 
a factor of — | in front of the trace term. This is where the brane bending mode f+{x) 
plays a crucial role. It cancels part of the graviton zero mode in just the right way 
to guarantee good agreement with solar system gravity tests. It is worth noting that 
this neat cancellation of terms does not always happen, even in single brane scenarios. 
We will see this in DGP gravity, for example, where one has to argue for some sort of 
Vainshtein effect to pass observational tests. 

Let us finally consider massive modes. In RS2 there is actually a continuum of 
massive modes, consistent with the fact that the extra dimension is no longer compact. 
However, as we have seen, we still recover AD GR to leading order. The next to leading 
order corrections are obtained by integrating over the continuum, or, equivalently, by 
considering the next to leading order expansion in pi above. The result is that the 
Newtonian potential reads^] 

v (r)* J(i+^+oa 3 A- 3 )). 

We refer the reader to [5251 1TUU51 1355] for details of the derivation. Note that the 1/r 3 
correction can be obtained in a dual picture as the one loop CFT correction to the 
graviton propagator |438j . 

5.4- Brane Cosmology 

There are two obvious reasons why cosmology offers an interesting arena in which 
to develop the braneworld paradigm. The first is that cosmological branes possess a 
high degree of symmetry, and this makes it possible to solve the field equations. The 
second is that cosmological physics can be tested by a number of observations, ranging 
from supernova data to the abundance of light elements. In this section we will study 
the cosmology of co-dimension one branes, focusing on the RS2 scenario with a single Z2 
symmetric brane. We will review the background dynamics [159, 178 before moving on to 
cosmological perturbation theory |716[I1244] , Further details, including generalisations to 
multi-brane scenarios with bulk scalar fields, can be found in the following review articles 
[TU251 15431 15551 17701 IT591 1T93) (see also [TU051 IM51 1M71 HT5J])- Other generalisations 
include: the cosmology of branes without Z 2 symmetry [10121 [TT591 11143) : anisotropic 
braneworlds [843, 966 ; and branes for which energy is explicitly transferred between 
bulk and brane 6!) 1 695 1234 . Note that any anisotropy is seen to dissipate on the 
brane in RS gravity, a feature that can be identified with CFT particle production in the 
holographic picture |966j . Bulk-branc energy transfer has been used to account for dark 
energy [SMI 15931 IT233] . 

Braneworld cosmology can be studied using two different formalisms: The brane based 
formalism, and the bulk based formalism. These two approaches are completely equivalent 
and yield a background cosmology governed by the following Friedmann equations [159[ 



Note that this result differs slightly from the original one quoted in 1052 . 
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(656) 
(657) 



where i? = a/a is the Hubble parameter along the brane, a(t) is the scale factor, and 
k — 0, ±1 describes the spatial curvature. The brane is sourced by a tension, a, and 
a cosmological fluid with energy density, p(t), and pressure, P(t). The parameters A4 
and G 4 denote the effective cosmological constant and Newton's constant on the brane, 



respectively. As in the standard scenario, the Raychaudhuri Equation (657) follows from 



the Friedmann Equation (656) and energy conservation, 

p + 3H(p + P) = 0. 



(658) 



We will now review the derivation of this cosmology using the two equivalent for- 
malisms. For the moment, however, let us comment on a few of its important features. 



From Equations (|656|) and (|657|) we see that the corrections to the standard cosmology 

" Pweyl • 



manifest themselves in a term 



and a dark radiation term 



The 



latter corresponds to a non-local "Weyl" contribution and can only be fixed by specifying 
the bulk geometry. In the holographic description of RS2, the p 2 corrections contribute 
to the conformal anomaly |1138j . while the dark radiation is identified with thermal 
excitations of the CFT [5701 ITHM HThM fTHTil . 

Both corrections will strongly affect Big Bang nucleosynthesis (BBN), so their mag- 
nitude can be constrained by the abundance of light elements. These require that the 
dark radiation can be at most 10% of the photon energy density, p wcy i/ p-y < 0.01, in the 
period following BBN [638] . In addition, the p 2 corrections to the cosmological evolution 
should be negligible after BBN, which imposes a constraint! 4 ^] on the tension a > (MeV) 4 

[US]. 

The p 2 corrections play the dominant role in the very early universe and will have a 
big impact on the inflationary dynamics. Assuming the inflaton is confined to the brane 
along with all the Standard Model fields, one finds that the slow roll parameters are 
given by |847] 
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where V(cf>) is the inflaton potential. At low energies, V -C cr, these match the standard 
formulae of AD General Relativity, 



1 

16ttG 4 



V ~ Vgr 



8itGa 



v 



48 Assuming a RS2 scenario with fine tuned tension, a = 12Afg/Z, the BBN co nstrain t is significantly 
weaker than the constraint arising from table top experiments quoted in Section |5,3,2 i.e. l/l > 10 — 4 
eV, giving a > (100GeV) 4 [l89]. Here we have used the fact that Mf = Mgl ~ 10 "TW 
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However, at high energies we have 



4ct 2cr , 

e ~ eGH— < e GR , rj ~ tj gr — < ?7 Gfl . (661) 

This means we can get away with steeper potentials in the braneworld case [847J . This 
is essentially because Hubble friction gets enhanced by the p 2 terms. However, such 
potentials are incompatible with observational constraints as they lead to a large tensor- 
to-scalar ratio 



5.4-1- Brane based formalism - covariant formulation 

We shall now derive the background cosmology, Eqs. (656) and (657), using the 
brane based formalism. This makes use of the Gauss-Codazzi equations 53 fl I320j . and 
the Israel junction conditions to derive the Einstein tensor on the brane |1140j . We 
will assume Z2 symmetry, although we refer the reader to |117j for a non-Z2 symmetric 
generalisation. 

The Gauss-Codazzi equations are the fundamental equations of embedded hyper- 
surfaces. The brane can be thought of as an embedding, 

x a = X a (^), (662) 

in the bulk geometry, J ab (x). By Z2 symmetry this is the same on both sides of the 
brane. We can now define tangent vectors V£ = <9X a /d£ M , and the outward pointing 
unit normal, n a , satisfying 

a b -1 a t rb n 

-f ab n n =1, -f ab n = 0. 

It follows that the induced metric on the brane is given by 

9^ = iab{X)V«Vt- (663) 

For such an embedding, the Gauss-Codazzi equations [5311 1320j give 

iW = K abcd V«V*V°V$ + K^Kvp - K^K^, (664) 
V^K^-Kg^) = K ab n a V», (665) 
R- K 2 + K tlv K* J,v = -2g ab n a n\ (666) 

where !Z a bcd is the Riemann tensor in the bulk, R^uap is the Rlemann tensor on the 
brane, and we recall that the extrinsic curvature of the brane is given by K^ v = ^C n g^, 
the Lie derivative of the induced metric, with respect to the normal. 

Now, in the RS2 scenario the dynamics of the brane are governed by the bulk equa- 
tions of motion and the Z2 symmetric junction conditions 

6 

Qab = -plab, K^ v - Kj^v = Za^g^ - 4irG 5 T^ u , (667) 

with c* = 47r ^ 5CT , where a is the brane tension and Tp, v is the energy-momentum tensor 
of additional matter excitations. Of course, in the RS2 scenario we have a fine-tuned 
tension such that er* = 1/7, but we will keep things general for the moment. 
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Consider the Codazzi Equation ( 665 1 . Because the bulk is only sourced by a cosmo- 
logical constant, the right hand side of this equation is identically zero. This is important, 
because the junction conditions in Eq. (6671 now imply the usual conservation law along 
the brane, V M 7^j, = 0. To extract information about the Einstein tensor on the brane, 
we contract the Gauss Equation (664), and plug in Eq. (667), to give |1140j 



Gf^ u = — A 4 (^„ + %-kG^T^v + (47rG 5 ) 2 n MI , — E^, 



(668) 



where A 4 = 3 (crj — w) is the effective cosmological cosmological constant on the brane, 
and the effective Newton's constant is given by 



Ga = G5CT* 



(669) 



The corrections to standard 4D gravity are encoded in a local contribution, II M „, and a 
non-local "Weyl" contribution, E^ v . The local piece is a quadratic combination of the 
energy-momentum tensor 



n 



—T a T + -TT 

'(l- 1 l'oT^( I flu 



(670) 



whereas the non-local "Weyl" piece is 



= C abcd n a VyV*, 



(671) 



where C a bcd is the Weyl tensor of the bulk. It is often referred to as the electric part 
of the Weyl tensor, and in general one must solve the bulk equations of motion first in 
order to evaluate it in full. We should also note that E^ has vanishing trace, and its 
divergence is sourced by the local quadratic contribution 



g^E. v = 0, 



V M ££ = (47tG 5 ) 2 V m IE;. 



(672) 



The latter equation follows from the divergence of Equation ( 668 1 , making use of the 
Bianchi identity and the local conservation of energy-momentum, V^T^ V = 0. 

Our interest here lies in the cosmology, so let us now assume spatial homogeneity and 
isotropy on the brane. The induced metric is given by the usual Friedmann-Lemaitre- 
Robertson- Walker (FLRW) metric 



ds 2 = g^d^dC = -dt 2 + a(tf 'q^dx' dx 3 , 



(673) 



where qij(x) is the metric of a hyper-surface of constant curvature, n — 0, ±1. The matter 
excitations contribute a cosmological fluid with energy density, p(t), and pressure, P(t), 
such that its energy-momentum tensor is given by 

Tf = diag(- p(t),P(t),P(t),P(t)). (674) 

In the absence of non-trivial sources in the bulk is conserved, which means we have 
the standard relation given by Equation ( 658 ) . 

It remains to compute the non-local piece, E^ v . Although in general we must solve the 
bulk equations of motion to evaluate this, we can exploit the large amount of symmetry 
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to avoid doing so in the current instance. Making use of the Constraints Equations ( 672 1 
we can show that 

where /x is an integration constant that should be fixed by the bulk geometry (we will 
see in the next section that it can be identified with the mass of a bulk black hole) . Note 
that is conserved on a cosmological background, by virtue of the fact that V M II^ = 0. 



Equation (668). 



The modified Friedmann Equations (656) and (657) now follow automatically from 



5.4-2. Bulk based formalism - moving branes in a static bulk 

The principle limitation of the brane based formalism we have just described is that 
it suppresses physics deep inside the bulk. This can be dangerous since it is not at all 
obvious if a particular choice of the non-local "Weyl" term, E^, at the brane will evolve 
into a pathological bulk geometry. Such problems can be avoided by solving for the bulk 
geometry first. For example, in the brane based formalism, the parameter [i giving rise 
to dark radiation, is just an integration constant and can take any sign. Using the bulk 
based formalism, we are able to identify /i with the mass of a bulk black hole, and use 
this to constrain it to be positive. 

We will now develop the bulk based formalism for cosmological branes. Again, we 
will assume Z 2 symmetry across the branes, for brevity. The reader can refer to |1008] 
for the non-Z2 symmetric generalisation. The bulk based formalism requires us to solve 
for the bulk geometry. Since we are interested in cosmological branes (with constant 
curvature Euclidean 3-spaces), we study the Einstein equations, Gab — j2^ab, with the 
following metric ansatz |178j 

ds 2 = lab dx a dx b = e 2l, A~ 2/3 {-dt 2 + dz 2 ) + A 2 / 3 q lj dx i dx j , (675) 

where A and v are undetermined functions of t and z, and as before Qij(x) is the metric 
of a hyper-surface of constant curvature, n — 0, ±1. Now, in an extremely elegant 
calculation, Bowcock et al. [178] were able to prove a generalised form of Birkhoff's 
theorem, showing that the bulk geometry is necessarily given by 

dr 2 

ds 2 = -V(r)dT 2 + — - T + rtqijdx'dx 3 , (676) 



V(r) 



where 



For [i > 0, the metric in Eq. (676) takes the form of a (topological) Schwarzschild black 
hole in anti-de Sitter space. Here we have written the solution in an explicitly time 
independent coordinate system, meaning that we can no longer say that we have a static 
brane sitting at a fixed coordinate position. On the contrary, we now have a dynamic 
brane, whose trajectory in the these coordinates is more complicated. Braneworld cos- 
mology from this perspective was first studied by Ida [639J, although moving branes in 
a static anti-de Sitter bulk were considered earlier by Kraus [754] . 
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To construct the brane solution, we treat it as an embedding, 

r = t(<), r = a(t), 



(678) 



of the bulk geometry given in Eq. (676). The induced metric on the brane is then 

A2 



ds z = g^d^dC 



-V{a)t z 



V{a) 



dt + a {t)qijdx l dx :> 



(679) 



where over-dots denote d/dt. We are free to choose t to correspond to the proper time 
with respect to an observer comoving with the brane. This imposes the condition 



V(a)t 2 



V{a) 



= -1, 



ensuring that the brane takes the standard FLRW form, as in Eq. ( 673 1 . The function 
a(t) is then immediately identified with the scale factor along the brane. 

The boundary condition at the brane are given in Eq. (667). We must compute the 
extrinsic curvature, K^ v = \C n g tlUl defined as the Lie derivative of the normal pointing 
into the bulk. Assuming we cut away the AdS boundary and retain the region r < a(t), 
we find that the inward pointing unit normal is given by 



n a = (-a, t, 0,0,0), 



(681) 



where we are free to specify that f > 0. The components of extrinsic curvature are then 
given by 



Vt ■ 
K l = —8 l - 



K* = - 



a + V/2 
Vt 



(682) 



in Eq. (667) yield the following: 



In the presence of a cosmological fluid, as given by Equation ( 674 ) , the junction conditions 

(683) 
(684) 



a a 



\V> 



Vt 



p 

1- 2- -3— 
a a 



Making use of Equation ( 680 ) we then arrive at the modified Fricdmann Equations ( 656 ) 
and (657). 



5.4-3. Cosmological perturbations 

While the theory describing cosmological perturbations in braneworld gravity has 
been well developed in recent years (see, for example, [8421 17711 18471 13341 110871 1991118011 

[27121127m imsi iTTaiMTiEmn MRimimKmiwmm^MmwmimRrmmRwm 

17691 15971 147)81 ITMl 14301 [9551 [2861 [3001 [3951 IT0621 [7871 [245] ) , even approximate solutions 
to the resulting field equations have been notoriously hard to come by. The problem 
stems from the fact that one has to solve the fully coupled system of brane and bulk, 
which is, in general, a far from trivial task. Indeed, this will generically render the brane 
based formalism somewhat incomplete without making ad hoc assumptions about the 
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perturbations of the non-local "Weyl" contribution, E^ u . The bulk based formalism is 
better in this respect, but the resulting system is virtually intractable, and only in some 
special cases, where the bulk and brane equations become separable, has progress been 
made 528, 596, 12021 12011 1772") . Note that, unlike in other sections, we will on occasion 
include vector and tensor perturbations, as well as scalars. This is because the bulk can 
source vector and tensor modes on the brane, giving qualitatively different behaviour to 
that seen in standard AD cosmology. 



Cosmological perturbations in the brane based formalism 



We shall now review some aspects of cosmological perturbation theory using the brane 



based formalism introduced in Section 5.4.1 Further details can be found in [8451 1842] . 
The dynamics on the brane are governed by Equation ( 668 ) , and so perturbations about 
the background cosmology on the brane satisfy 



<5G£ = 87rG 4 <ST£ + (47rG 5 ) 2 (5n(; - 8E%. 



(685) 



To study this we use the standard four-dimensional formalism [901 17171 14691 1917j , decom- 
posing the system into scalar, vector and tensor perturbations with respect to the spatial 
diffcomorphism group in the background cosmology. Working with conformal time as 
opposed to proper time, the perturbed metric is given by 



ds 1 



-(1 + 2*)dr 2 - 2(fa + V.fadx'dr 



(1 - 2$)^ + D i3 -v + 2V (i A 



dx l dx J 



where Ai and fa are transverse vectors on qij, and hij is a transverse and trace-free tensor. 
Recall that the operator = V j Vj — A. The fluctuations energy-momentum on the 
brane are written in terms of the fluctuations in density, 6, pressure, SP, fluid 3- velocity, 
Vi, and anisotropic stress, in the usual way, 



5%° = - P S, 



ST? 



-(p + P) Vi , 67] = SP5) + (p + P)Ej 



(687) 



The fluid 3-velocity and anisotropic stress can then be decomposed with respect to their 
scalar, vector and tensor components, 



•j(vcctor) 

u 

r-,(tcnsor) 



Vi6, 



(vector) 



The fluctuation in the quadratic piece is also given in terms of pS, SP . . . etc., according 
to 



SU° = 2 - P 5%\ <5n° = ^pST^ 

sn) = -\(p + zpw; -\{p + p)5) (V ° - \st^ 



(690) 



where 8Tjf are given by Equations (687). 

201 



Now we consider the contribution from the non-local "Weyl" perturbation, and iden- 
tify it with the fluctuation in some dark energy energy-momentum tensor 



SE U 



In direct analogy with Equations (687), we can read off corresponding fluctuations in 
Weyl energy density, <5 woyl , Weyl pressure, SP wcyl , Weyl fluid 3- velocity, v™ cyl , and Weyl 
anisotropic stress, S^ eyl . We might hope to determine each of these in terms of the local 



matter fluctuations, 5, SP . . . etc., by making use of the Constraints Eqs. (|672|) to linear 
order. We find that for scalar perturbations SEft = givea^ 



Jpwcyl _ l^woyljwcyl 

3 



and V M [6Eg - (47rG 5 ) 2 U>] = gives 



(jwcyly + 



-A 



nwcyl 



= 0, 



weyl 



o Xwcyl 

(tfweyly + $ _ _(A + 3K)E woyl — 



p + P 



{ P 5 + { P + p)\me - (A + 3k)s]} . 



(691) 



(692) 



(693) 



where ' denotes differentiation with respect to conformal time, and H = a' /a = aH. 
Clearly we have too many Weyl unknowns and not enough equations. The bottom line is 
that we need to know the Weyl anisotropic stress STr™ cyl explicitly, and for that we need 
to abandon the brane based formalism and solve the bulk equations of motion [842 . 

Some progress can be made at super-horizon scales, since then we can neglect the 
spatial gradients in Eqs. (692 1 and (693), and solve for the Weyl energy density and 
Weyl momentum in terms of pS, SP . . . etc., thereby closing the system |842[ 15481 1771j . 
This simplification has been applied to the study of both density perturbations and 
vector perturbations at large scales, revealing qualitatively different behaviour to that 
in General Relativity [548, 844 . Regarding density perturbations, it can be shown that 
the quantity pa 4 Ac) will grow during slow- roll inflation on super-horizon scales (it stays 
constant in GR) |548j . It can be shown that vector perturbations can be non- vanishing, 
even in the absence of vorticity [844 . 

We can also solve for the (total) curvature perturbation on large scales |771j . Unfor- 
tunately, this does not mean we can compute large-scale CMB anisotropics. The problem 
is that to evaluate the (non-integrated) Sachs- Wolfe equation we need knowledge of the 
metric perturbations. These are sourced by the dark anisotropic stress, according to 



$ - # = 87rG 4 a 2 



pWcyl^ £\vcyl 



(694) 



49 The remaining formulae in this section are taken from |771| . where one should identify 8p = pi, 5q ■ 
-(p + P)6, Sn = (p + P)S, Ti = A = E=\v, B = -/3. 
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where and $ are the Bardeen gauge invariants for the metric perturbations, and we 
have neglected the local anisotropic stress. The braneworld corrections to the Sachs- 
Wolfe effect are given by |771j 



ST _ ST 
~T ~ ~T 



8 ( P-t 

GR 3 \Pcdm 



& 



wcyl 



- 8nG 4 a 2 ^p wcyl ) S^ 1 + J da a 7 ' 2 Qp wcyl ) £ wcy \ (695) 

where <S> woy i is the Weyl entropy perturbation, determined by pS weyl . 
Cosmological perturbations in the bulk based formalism 

As we saw in the previous section, a proper treatment of cosmological perturbation 
theory in brane cosmology requires us to solve the coupled system of brane and bulk. 
We will now present the details of this in the bulk based formalism, essentially following 
[7TB] (see also P^Ml9T51[gT9l[g2Tl[^n] ). Note that in this section we deviate from our 
usual convention of treating cosmological perturbations with respect to conformal time 
on the brane, preferring instead to use proper time of comoving observers, in keeping 
with the majority of the relevant literature. For consistency, however, we do define the 
brane quantities as in the previous section. 

We begin with some notation on the background. The bulk metric is given by 



ds 2 = r y a f l dx a dx b = X a pdx a dx 



dx l dx 3 , 



(696) 



where X a p is some two-dimensional metric, and A Q( g and r depend only on the first two 
coordinates, x a . This corresponds to a section of (topological) AdS-Schwarzschild, and 
one can choose a gauge such that 



\ a pdx a dx = ~V(r)dT 2 + 



dr 2 

WY 



V(r) 



(697) 



However, here we will leave the choice of gauge unspecified so that our analysis can 
be applied in other gauges (e.g. Gaussian-Normal gauge). The background embedding 
equation is now given by x a = A a (£ M ), where £ p = (t,£ z ) are the coordinates along the 
brane. We take 

X a =X a (t), X l =C; 

and \ a /3(X)X a XP = — 1 to en sure that the induced metric on the brane has the standard 
FLRW form, as in Eq. (6731, with scale factor a(t) = r(X). Over-dots here denote 
differentiation with respect to proper time on the brane, t. 

We shall now specify the perturbations in the bulk. Decomposing the bulk metric in 
terms of scalar, vectors and tensor components (with respect to Qij), we write 5j a b = 



u scalar i u vector 
n ab + n ab 



ft* e 6 nsor , where, 
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D i3 E] , 



(698) 
(699) 
(700) 
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where = VjVj — gQ^'A. Here B ai and are transverse, and is transverse and 
trace-free. One can identify the following gauge invariants in the bulk [716) : 

scalars: Y afj = Xa p ~ 2D {a Q f3): Z = A - - AE - Q a d a In r, (701) 



vectors: F ai = B ai — rD a Hi, 
tensors: Ha, 



(702) 
(703) 



where D a is the covariant derivative on \ a p, and Q a = r(\a — rd a E). For simplicity 
we will assume that there are no exceptional modes [716] in any sector. Now, the bulk 
equations of motion, 5G% — 0, are extremely complex in general (see 716 ). From a 
practical perspective, the only way to proceed is to assume that the background bulk is 
maximally symmetric anti de Sitter space, so there is no bulk black hole (i.e. ji = 0) 
|918j . The vector and scalar perturbations in the bulk can then be expressed in terms of 
a corresponding 'master variable'. For example, the scalar gauge invariants Y a p and Z 
can be written as 



Y a0 = 



1 



D a D/3 — 



D 2 — 



1 

2T 2 



n, Z = --Y = — 
2 6r 



D 2 - 



where the scalar master variable satisfies 



D 2 ~ 3<9 Q ln rD a 



A + 3k 



f) = 0. 



n, (704) 



(705) 



For the vectors, we introduce the vector master variable, writing the gauge invariant as 

1 



F„ 



(706) 



where e Q ^ is the total antisymmetric Levi-Civita tensor on A Q ^, and f2i satisfies 



D a 



r 5 D, 



= 0. 



(707) 



The tensor gauge invariant, H^, is essentially its own master variable, and satisfies 



D 2 + 3D a In rD a + (A - 2k) 



= 0. 



(708) 



To solve for these master variables we need to specify boundary conditions at the branc, 
as well as the asymptotic boundary conditions. The latter correspond to the condition 
of normalisability. Let us now consider the boundary conditions at the brane. The first 
thing to note here is that the brane position can fluctuate, so that it now corresponds 
to an embedding x a = X a + / cs (£ Al ). We decompose this fluctuation in terms of scalars 
f a , e, and a transverse vector e\ such that f a — (f a ,e l + V 4 e). Secondly, in applying the 
boundary conditions, it is important to express all fields in terms of covariant objects 
along the (background) brane. This suggests that all fields of the form W a and W a/3 
should be decomposed in terms of their components parallel to the tangent vector X a 
and along the normal n a , as follows: 



W a = -W\\X a 



W ± n a , (709) 

VF||||X Q 1 /3 - W\i X X a n - W x \\X^n a + Wxxn a n p . (710) 
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The induced metric on the brane then takes the form 



ds 1 



with 



-(1 + 2^)dt 2 - 2a(ft + V^dx'dt 



+ a* 



(1 - 2$)qij + DijV + 2V(i-Aj-) + h i3 dx l dx 3 ', (711) 



-Z - ^Ais + HU\\ - (D± \nr)U ± , 



a 

v = 2{E + e) 
A,, = Hi + a 



av 



(712) 

(713) 

(714) 
(715) 



where U a = Q a + f a . Note that U a , v and Ai are invariant under gauge transformations 
in the bulk, so these expressions have been written entirely in terms of bulk gauge 
invariants. 

The gauge invariants on the brane are the ones familiar to us from standard cos- 
mological perturbation theory in four dimensions. Given the fluctuation in the brane 
energy-momentum tensor 



57? = -p5, 57? = -a{p + P)v u 57] = 5P5^ + (p + P)E}, 



(716) 



and Equations ( 688 ) and ( 689 1 , we note that the components of the anisotropic stress 
E, vj(vcctor) an( j ^(tensor) are ^ gauge invariant on the brane, along with the following 
scalars, 



$ = $ + ^Av + aa g = -Z - (D x In r)Ux, 
* = H>-(av g J=~Y M +KiUx, 

6 = 6-<Tg, 

5 = 5 + 3Ha ( 1 + - 

V p 

r = p5 — c 2 s 5P, 
the vectors 0, and (3i = Pi + aAi = —F\\i, and the tensor, Note that 



(717) 

(718) 
(719) 
(720) 
(721) 



El 

a 



The goal is now to solve for the master variables, subject to boundary conditions set 
by normalisability and the values of the non-dynamical gauge invariants on the brane, 
namely, T, E, vj( vcctor ) and £g™»>. We 

can then use this knowledge to derive the dy- 
namical gauge invariants, $i,hij,9,5, and 0$. We are already able to express the 
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metric invariants in terms of the master variables (and U±) as follows: 



$ = 



1 



HQ - (L> ± \nr)Dj_n + 
-(£>x lnr)C/x, 



P 



(A + 3k) 
3^2 



1 

2a 



- 2HQ + (2L>x lnr - Kf) Dj_n - 



1 2(A + 3k) 



3a 2 







Pi 



(722) 
(723) 
(724) 



To fix U±, and the remaining dynamical gauge invariants, we need to impose boundary 
conditions at the brane, given by the linearised Israel junction conditions 8(Kjf — K6£) — 
— 47rG 5 <57^. For the scalars this gives 



(A + 3k)Dx [ — ) = 87rG 5 a 2 pS - (A + 3k)[(p + P)E] 



U± 



8ttGci 2 



{p + P)6-d t [a{p + P)Y] 



47rG 5 a 2 (p + P)S, 



-r = c 2 sP 5 + ( P + P)(^- d t (ad) + | ( A + 3k)S 



(725) 

(726) 
(727) 
(728) 



We should now view these equations as follows |716j : Equations (725) and (726) fix S 



and 9 give n kn owledge of the anisotropic stress, E and the master variable, f2, while 
Equation (727) fixes U±. One can then substitute the expression for 5 and 9 (in 



terms of £1 and S) into Equation (728) to derive a Neumann type boundary condition 



on ft. In principle, one should now be able to solve the Master Equation (705), with a 



suitable boundary condition derived from Eq. (728), and feed the solution back into the 
expressions for Vl/, 9 and 6. 



For vectors, the Israel junction conditions yield 

(A + 2k)0, ; = 8nG 5 a A (p + P)9i, 

(li = 47rG 5 a 3 (p + P)E l (vcctor) . 



(729) 
(730) 



This time we view Equation ( 730 ) as a Dirichl et ty pe boundary condition on the master 
variable. We then solve for VLi using Equation (707), and subsequently fix /3j and 9i using 



Equations (724) and (7291 



Finally, we consider the tensor. Here we solve Equation ( 708 1, subject to the following 
junction condition at the brane: 



^x^=-47rG 5 ( (0 + F)^ cnsOT) . 



(731) 



We have now presented the formalism in full, but the task of solving the system 
explicitly is another matter altogether. To do so one must choose coordinates for the 
background bulk. Whilst the static coordinate system might seem the simplest from 
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the bulk perspective, it is rarely used owing to the fact that a Gaussian Normal (GN) 
coordinate system makes it much easier to specify the boundary conditions. For example, 
when = and k = 0, the metric in GN coordinates is given by [158] 



X a pdx a dx p = dz 2 - N 2 {t, z)dt 2 



r(t, z) = a(t) 



cosh (|* | /J) 



a J 



sinhOI/0 



(732) 
(733) 



where N = r i t /^} ■ Even in this case analytic solutions can only be obtained in the case 



a(t) 



of a background de Sitter brane, when the Master Equation ( 705 1 becomes separable 



|202l 1772] . Numerical solutions, however, have been obtained for scalar perturbations on 
a radiation dominated brane |245j . It was found that short wavelength density pertur- 
bations are amplified relative to their value GR during horizon reentry, but not so much 
that they cause an observable effect in the CMB or in large-scale structure. 



5.5. Dvali-Gabadadze-Porrati Gravity 

The most celebrated braneworld model exhibiting guasi-localisation and an infra-red 
modification of gravity is without doubt the DGP model, developed by Dvali, Gabadadze 
The model admits two distinct sectors: The normal branch and the 



and Porrati [454] 

self- accelerating branch. The latter has generated plenty of interest since it gives rise 
to cosmic acceleration without the need for dark energy |394l 1398] . However, as we will 
see, fluctuations about the self-accelerating vacuum suffer from ghost-like instabilities 
|2731 110151 559 , 546J . Although the normal branch is less interesting phenomenologically, 
it is fundamentally more healthy and is the closest thing we have to a consistent non-linear 
completion of massive gravity. Here the graviton is a resonance of finite width, l/r c , as 
opposed to a massive field. At short distances, r <C r c , the brane dynamics do not feel 
the width of the resonance, and the theory resembles 4D GR. At large distances, r r c , 
however, the theory becomes five dimensional as the resonance effectively decays into 
continuum Kaluza-Klcin modes. It is claimed that the Vainshtein mechanism works well 
on the normal branch of DGP, screening the longitudinal graviton without introducing 
any new pathological modes, in contrast to massive gravity [4051099]. The breakdown 
of classical perturbation theory at the Vainshtein scale can be linked to quantum fluctu- 
ations on the vacuum becoming strongly coupled at around 1000 km |839l 110731 1447j . 



5.5.1. Action, equations of motion, and vacua 

The DGP model contains a single 3-brane embedded in an otherwise empty five 
dimensional bulk space-time. Generalisations that include a bulk cosmological constant 
and/or bulk branes have been studied |697l 11011] . However, the original DGP action is 
given by 



S = Ml 



d 5 o 



M 



XyJ—g 



dM 



M 2 
5 2 



(734) 



where, by Z2 symmetry across the brane, we identify the entire bulk space-time with two 
identical copies of M. , and the brane with the common boundary, dAi . The bulk metric 
is given by j a i,, with corresponding Ricci scalar, 1Z, and M 5 is the Planck scale in the 
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bulk. The induced metric on the brane is given by g„v, and K = g^K^v is the trace of 
extrinsic curvature, K^ v — ^C n g^ u . Here we define the unit normal to point into M.. 

The brane has a bare vacuum energy, or tension, <j, and additional matter contri- 
butions contained in C ma tter- However, the key feature in the brane Lagrangian is the 
induced curvature term, given by M 2 R. Such a term can be generated by matter loop 
corrections [161] . or finite width effects |263j . The mass scale, M4, is taken to be Planck- 
ian, ~ 10 18 GeV. There is a hierarchy between this scale and the bulk Planck scale, 
A/4 S> M5, which has proven difficult to derive from fundamental theory. Nevertheless, 
the hierarchy enables us to identify a crossover scale, r c ~ M%/2M§, below which the 
theory looks four dimensional, and above which it looks five dimensional. 

The bulk equations of motion are given by the vacuum Einstein equations, 

Gab = Tlab - -^lab = 0, (735) 
and the boundary conditions at the brane are given by the Israel junction conditions, 

2M 2 (K^ U - K 9fiv ) = M%G MV + a 9fW - T^, (736) 

S_ r 

-g 5gt"< JdM ' 



where % v = -^=5^7 J dM d^x^gC^ttcr is the energy-momentum tensor for the 
additional matter. 

Let us now consider the vacua of this theory, corresponding to maximally symmetric 
brane solutions with T^u = 0. For a given tension, there exist two distinct vacua [3941 



398 . Assuming a > for definiteness, one finds that these correspond to de Sitter branes 
with intrinsic curvature, 

Ml [ / Mia] 
H -Ml [ £+ V 1+ 6M|J' (737) 

where e — ±1. In conformal coordinates, the full solution can be written as 

ds 2 = % b dx a dx b = e 2eHv (dy 2 + g^dx^dx"), (738) 

where g^dx^dx" = —dt 2 + e 2Ht dx 2 is the 4D de Sitter line-element written in Poincarc 
coordinates. The domain M. corresponds to < y < 00, while the brane is located on 
the boundary at y = 0. 

The two branches of this solution are often referred to as the normal branch (e = —1), 
and the self-accelerating branch (e — 1). The latter is so-called because as we take the 
limit of vanishing vacuum energy, a —¥ 0, the metric on the brane is still asymptotically de 
Sitter. The limiting de Sitter curvature is given by the cross-over scale, H = 2Mf/Mf ~ 
l/r c . If this were to account for dark energy today, the crossover from four to five 
dimensions would have to occur at the horizon scale, placing the fundamental Planck 
scale at M 5 ~ 10 MeV or so. However, as we will discuss in Section |5.5.3| this branch 
of solution is unstable as it contains physical ghost excitations. The normal branch 
is asymptotically Minkowski in the limit a — > 0, and does not suffer from a ghostly 
pathology. On this branch it is phenomenologically more interesting to have the cross- 
over scale at distances below the horizon scale. 

The distinction between branches is best understood by considering their embedding 
in the 5D Minkowski bulk. In each case, the brane can be viewed as a AD hyperboloid 
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Figure 11: Embedding of a de Sitter brane in a flat 5D bulk, taken from 273, 559 . The braneworld 
volume is the hyperboloid in the Minkowski bulk. The normal branch corresponds to keeping the interior 
of the hyperboloid, while the self-accelerating branch corresponds to the exterior. 



of radius 1/H (see Figure 111. The choice of sign, e = ±1, corresponds to the choice 
in whether one retains the interior (e = —1) or the exterior (e = 1) of the hyperboloid. 
Note that retaining the exterior ensures that the brane behaves like a domain wall with 
negative effective tension, even when a > [559] . This is an early warning sign that the 
self-accelerating branch could be pathological. 

5.5.2. Linear perturbations on the normal branch 

We now consider linear perturbations on the normal branch. For simplicity and 
brevity, let us consider the limiting case of vanishing vacuum energy, a — 0, corresponding 
to fluctuations about a Minkowski brane in a 51? Minkowski bulk, = rj ab + h ab . It is 
convenient to choose harmonic gauge, 

daK = \d b h a a , (739) 

so that the linearised bulk equations of motion take the simple form 

[d 2 + d 2 ]h ab = 0, (740) 

where d 2 = d^d^. Taking Fourier transforms with respect to the brane coordinates, so 
that 

Q(x,y) -> Q(p,y) = J d 4 xe^ Q(x,y), 

it can be seen that the normalisable solution is given by 

h ab (p,y) = e- py h ab {p,0). (741) 
209 



We have not yet fully fixed the gauge as there is still a residual symmetry corresponding to 
gauge transformations of the form x a — > x a + £ a , where (d 2 + <9 2 )£ a = 0, or, equivalently, 
£ a (p,y) = e~ pv t; a (p 7 0). We now use £_ v to fix the brane position to lie at y = 0, and £ M 
to set hfj, y — 0. It follows from the y component of Equation (739) that h yy = h, where 



h = ri^h^. The v component of Eq. (7391 then gives d^h^ = d v h. 
The junction conditions at the brane, 



2Ml5{K tiV - K 9fiU ) = MtSG^ 



Tim, 



now yield the following solution for the metric on the brane: 

2 



3 ^v ,u 



Mlp 2 + 2Mgp 
At low energies, p -C l/r c = 2M^/M 2 , this becomes 



pure gauge pieces. 



V/(P, 0) 



Tu,i> 



(742) 



(743) 



which suggests that the potential for a point mass will scale like V(r) oc 1/r 2 at large 
distances, r ^> r c . This is consistent with 5D gravity, and is to be expected for an 
infinitely large 5th dimension. In contrast, at high energies, p 3> l/r c , we have 



h^{p, 0) 



M 4 V 



(744) 



This can be compared with the propagator in four-dimensional General Relativity, = 

T^iv — \Tr\yuv ■ Thus we almost have agreement. Indeed, since the scaling with 

momenta is the same, so the potential for a point mass will go like V(r) oc 1/r for r <C r c . 
We can understand this as follows: The graviton mediating the interaction between two 
point particles along the brane is bound close to the brane by the induced curvature 
there. It is only at very large distances, when the induced curvature is insignificant, that 
the graviton is able to probe the extra dimension. 

However, at this stage it would be premature to say that we have localisation of 
gravity at short distances. The problem lies with the tensor structure of the propagator. 
As we can see, the GR result differs from DGP in that it has a factor of \ as opposed 
to |. This is due to the longitudinal mode of the graviton propagating in DGP but not 
in GR, and will inevitably lead to disagreement with observation. To see this, consider 
the amplitude for the interaction between two sources, and T'^ v , which is given by 
A = h veu T' tlv . In GR, we have 



A 



GR 



T»MT'^{~p)-\t{p)t'{-p) 



whereas in DGP, at high energies, we have 



A 



DGP 



M\p 2 



1^ 



T^T'^i-p) - -T(p)T'(-p) 
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(745) 



(746) 



To get the same potential for the interaction of two non-relativistic sources, we require 
thaip°| M| = §-M^. Now consider what happens when one of the sources is relativistic, 
say TL V . We then have T" = 0, and so the amplitudes differ by a factor of |. So, although 
we can recover the standard Newtonian potential, the prediction for light bending around 
the Sun differs from GR by a factor of f . 

This is reminiscent of the vDVZ discontinuity in massive gravity [12431 [1296 . In 
fact, it is sometimes said that DGP gravity (on the normal branch) is the closest thing 
we have to a non-linear completion of massive gravity. Strictly speaking, the behaviour 
is more like a massive resonance of width l/r c , with decay into a continuum of massive 
modes occurring at r > r c . However, the similarities with massive gravity do suggest a 
possible resolution to this issue of tensor structure, and the resulting phenomenological 
problems. In massive gravity, it has been argued that the linearised analysis around a 
heavy source breaks down at the so-called Vainshtein radius, which can be much larger 
than the Schwarzschild radius [1241] . Below the Vainshtein radius, it is claimed that 
non-linear effects help to screen the longitudinal scalar and match the theory to GR 
without a vDVZ discontinuity. Similar claims have been made in DGP gravity [399], as 
we will discuss in Section f5. 5. 41 

5.5.3. Linear perturbations (and ghosts) on the self- accelerating branch 

Let us consider linearised perturbations and ghosts about the self-accelerating vacuum 



described in Section [5T5T] (see also [2731 \EM \SM HSU)- To reveal the ghost in 

the cleanest way possible, it is convenient to retain non-zero tension on the background. 
It is also enough to consider vacuum fluctuations, so we will set T^u = for brevity (a 
more complete discussion including matter fluctuations can be found in [273j ~1 . 

Recall that the self-accelerating background solution is given by the geometry in Eq. 



(7381, with e = +1 and the brane positioned at y = 0. A generic perturbation can be 
described by j ao — */ a b + 5 r ) a b 1 with the brane position shifted to y = F(x). We will work 
in Gaussian Normal (GN) coordinates, so that 

hyy = = 0, and 6^ v = e Hy ' 2 hfj, v (x,y). (747) 

The tensor can be decomposed in terms of the irreducible representations of the AD 
de Sitter diffcomorphism group as 



1 _ , h 



V = Ki + V ^ A v + + V M V„<£ - -g^U<t) + -g^, (748) 

where V is the covariant derivative associated with the AD de Sitter metric, g^ v . The 
tensor hT£ is transverse and trace-free, V^hF 1 - p „ = g^hTJ = 0, and is transverse, 
V^A^ = 0. In addition, we have two scalars, cj) and h = g^h^. 

Following a similar approach to the one outlined for Randall-Sundrum gravity in 
Section 5.3.5[ we can now fix the position of the brane to be at y = while remaining in 



GN gauge by making the following gauge transformation: 

e -Hy 

y^y- Fe~ Hy , and x» -> V^F. (749) 

H 



50 To verify this result, simply insert the appropriate energy-momentum tensors for non-relativistic 
rces, T^ v = diag(p 
match the two results. 



sources, = diag(p, 0, 0, 0), and = diag(p' , 0, 0, 0), in each expression for the amplitude, and 
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Although the brane position is now fixed at y = 0, the original brane position F{x) 
still enters the c 
perturbation as 



still enters the dynamics through a book-keeping term, h^v , that modifies the metric 



V -> h l» + + v ^ + V p V„</> - |.9^D</> + + hfj . (750) 

The book-keeping term is pure gauge in the bulk, and is given by 

h^ = ^e H ^{V^ v + H% v )F. (751) 

We can now substitute our modified expression for into the linearised fields equations 
in the bulk, SQ a b = 0, and on the brane, 2M§S(K tlv — Kg^ v ) = M 2 SG^. It turns out 
that the transverse vector, A^, is a free field in the linearised theory and can be set to 
zero. In addition, the yy and yfj, equations in the bulk imply that one can consistently 
choose a gauge for which 

h = 0, and (□ + AH 2 )<j> = 0. (752) 

Note that we now have h^ u = + hffi + hffi , where the contribution from (f>(x, y) has 
been rewritten as 

h$ = (V„V„ + H 2 g^) 4>{x, y). (753) 

This mode is now entirely transverse and trace- free in its own right. In the absence of 
any additional matter on the brane, T^ v = 0, the same is true of the book-keeping mode, 
hffi . This is because the trace of the Israel equation now implies that 

(□+4H 2 )F = 0. (754) 

The entire perturbation h /J/V (x,y) is therefore now completely transverse and trace-free. 
This greatly simplifies the bulk and brane equations of motion, giving 



□ -2^ + ^- 9 | 



M 2 (D -2H 2 ) + 2M| (d y - 



h ltu (x,y) = fory>0, (755) 

V = ° aty = 0+. (756) 



3H 



Variables in the tensor and scalar fields can now be separated as follows: 

h T J(x,y) =J2 u ™(yH™ ) ( x )> <K x >v) = W(y)Hx), (757) 

m 

where xffl IS 4-D tensor field of mass m satisfying (□ — 2H 2 )\ffi = rr?xffi ■ Note that 
4> is a 4Z) tachyonic field satisfying (□ + 4H 2 )(f> = 0. This is a mild instability, related to 
the repulsive nature of inflating domain walls. 

Assuming that the tensor and scalar equations of motion can be treated indepen- 
dently, we find that there is a continuum of normalisable tensor modes with mass 
to 2 > 9iJ 2 /4. In addition, there is also a discrete tensor mode with mass 



(758) 



and normalisable wave-function u md {y) = ct md e~ v v ™ d . Now, for positive brane 
tension, a > 0, one can easily check that < m 2 d < 2H 2 . For massive gravitons 
propagating in AD de Sitter, it is well known that masses lying in this range result 
in the graviton containing a helicity-0 ghost [605] . The lightest tensor mode in this case 
is therefore perturbatively unstable. For negative brane tension we find m 2 , > 2H 2 , and 
there is no helicity-0 ghost in the lightest tensor. 

Now consider the scalar equations of motion. The first thing to note is that hffi 
behaves like a transverse trace-free mode with mass m| = 2H 2 , because (\3 — 2H 2 )h\fJ = 

2H 2 tijfi! . Since none of the tensor modes have this mass, they are all orthogonal to h*jfj . 
This means it was consistent to assume that the scalar and tensor equations of motion 
could indeed be treated independently. It turns out that the scalar has a normalisable 
wave- function W(y) = e~ Hy / 2 , and the 4D scalar <fi is sourced by F by the relation 

4>{x) — aF(x), a = — 

This is well defined for a ^ 0, as in this case H ^ l/r c . h)fj{x,y) may now be thought 
of as a genuine radion mode, measuring the physical motion of the brane with respect to 
infinity. It does not decouple even though we only have a single brane. This property is 
related to the fact that the background warp factor, e 2Hy , grows as we move deeper into 
the bulk. 

We have now identified the helicity-0 mode of the lightest tensor as a ghost when 
a > 0. When a < 0, a calculation of the 4_D effective action reveals the ghost to be 
the radion (see [2731 11015] ). Given that there is always a ghost for non-zero tension, by 
continuity one may expect this to remain the case when a = 0. 



2H 



H H 



(759) 



The limit of vanishing tension 



To study this limit more closely, let us first ask whether we can trust the above 
solutions in the limit where a — > 0. In this limit H — > l/r c , and the quantity a becomes 
ill-defined. To understand what has gone wrong, note that the mass of the lightest tensor 
has the limit m 2 , 2H 2 . This means that it is no longer orthogonal to the radion, hjjtJ , 
and so we cannot treat the tensor and scalar equations of motion independently. This 
behaviour can be traced back to an additional symmetry that appears in the linearised 
theory in the limit of vanishing brane tension. It is analogous to the "partially massless 
limit" in the theory of a massive graviton propagating in de Sitter space [605 . In that 
theory, the equations of motion are invariant under the following redefinition of the 
graviton field: 

xif H) (x) -+ xlf H) (x) + (V M V„ + H%Mx). (760) 

This field redefinition has the effect of extracting out part of the helicity-0 mode from 

X/j^ H \ an d as a result of the symmetry this mode disappears from the spectrum. In 
DGP gravity this shift must be accompanied by a shift in the scalar field, (j>, 

<i>(x>y) -> <t>(x,y) - a^2H e ~ Hv/2 ^( x ) = 0( x ,y) - lim lam/^^ ™^(^), (761) 

v rr — 



in order to render the overall perturbation, h fJlU (x,y), invariant. These ijj shifts can be 

understood as extracting part of the helicity-0 mode from x^i^^ i an d absorbing it into a 
renormalisation of (f>. The symmetry will have the effect of combining the helicity-0 mode 
and the radion into a single degree of freedom. It is only after fixing this ip symmetry that 
we can treat the scalar and tensor equations of motion independently of each another. 
We could also consider extracting the entire helicity-0 mode and absorbing it into 4>, or 
vice versa. 

Actually, there exists a clever choice of gauge that enables us to take a smooth limit 
as (j — y [546] . One can then readily calculate the 4D effective action |273j . and derive 
the corresponding Hamiltonian [546 . It turns out that the Hamiltonian is unbounded 
from below, signalling a ghost-like instability |546| . This ghost is a combination of the 
radion and helicity-0 mode, and represents the residual scalar degree of freedom left over 
after fixing the aforementioned ip symmetry. 



Ghosts 



We have shown that for any value of the brane tension, perturbations about the self- 



accelerating branch of DGP contain a ghost. As explained in Section 2.1.3 the ghost will 
generate catastrophic instabilities as it couples universally with gravitational strength to 
the Standard Model fields and the remaining gravity modes. The existence of the ghost 
can be trusted as long as we can trust our effective description. It had been argued 
that this breaks down at the Hubble energy scale, casting doubt on the ghost's existence 
at sub-horizon distances |401| . However the analysis of |741j suggests that the cut-off 
for the effective theory is actually at a much higher energy, being the same as that on 
the normal branch. As we will see in the next section, for fluctuations on the vacuum 
this corresponds to A cut _ ff ~ 10 -13 eV. As this is well above the characteristic scale of 
the self-accelerating vacuum, H ~ 10~ 33 eV, the ghost will cause this vacuum to be 
infinitesimally short-lived due to a divergent rate of particle creatiorpj 

It is also worth pointing out that the ghost can manifest itself beyond perturbation 
theory. In the presence of a self-accelerating brane one can accommodate a Schwarzschild 
bulk with negative mass, without introducing any naked singularities. This demonstrates 
that the five-dimensional energy is unbounded from below, as suggested by the perturba- 
tive analysis. Furthermore, standard Euclidean techniques indicate that the spontaneous 
nucleation of self-accelerating branes is unsuppressed. This is problematic even on the 
normal branch, and suggests that self-accelerating branes should be projected out the 
from the theory altogether [559J . 

5.5.4- From strong coupling to the Vainshtein mechanism 

We now return to the normal branch of DGP gravity. In Section [5.5. 2 1 we saw some 
similarities between this theory and massive gravity. Indeed, it happens that the brane 
to brane graviton propagator is given by 

D°Z(p) = D™Zf c (p,p/r c ), (762) 



51 Note that it has been argued that this choice of vacuum state explicitly breaks de Sitter invariance 
1653) . and so one is free to impose a Lorentz non-invariant cut-off in the 3- momentum such that the 
creation rate is no longer divergent. 
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where -D™J^ 1VC (p, m 2 ) is the propagator for AD massive gravity. Thus, just as massive 
gravity suffers from the vDVZ discontinuity 1243, 1296 as m — > 0, so does DGP gravity 
(normal branch) in the limit p <C l/r c . This means the linearised theory is not reduced to 
GR at short distances, but to a four-dimensional scalar-tensor theory. If this description 
can be trusted at solar system scales, then it leads to wildly inaccurate predictions for 
the bending of light around the Sun. 

In massive gravity, it turns out that the linearised theory cannot necessarily be 
trusted at solar system scales. This is because it breaks down at the Vainshtein ra- 
dius, ry asslvc ~ (rs/rra 4 ) 1 / 5 , where r s ~ 3 km is the Schwarzschild radius of the Sun. For 



.massive 



quantum fluctuations in vacuo, the breakdown of classical perturbation theory at : 
translates into a strong coupling scale A cut _ ff ~ (Mpim 4 ) 1 / 5 [64]. Note that the equiv- 
alent scales in four-dimensional General Relativity are simply the Schwarzschild radius 
and the Planck scale. 



Strong coupling in DGP gravity 



The situation in DGP gravity (normal branch) is similar to the case of massive gravity, 
as we will now show. While aspects of the strong coupling problem were first identified 
in [6961 11073] , the derivation of the strong coupling scale for quantum fluctuations on 
the Minkowski brane is most elegantly presented in [839] . Here we review the results 
of this studjP^Jsee also |1017j ). Their analysis involves a computation of the boundary 
effective actior which is found to be 



, x Ml 



r[hn V ,Nn,hyy] = J d 

--N» (^P+ ~) N» + ^-h^tPhyy - -N^d^hyy - ^~ hyy h 



+ 2 / <fx h tH/ V a ' + T int [h^, N^, hyy], (764) 



where — h^ y and d 2 — d^d^. The function Ti nt contains all the higher-order inter- 
action terms to be discussed shortly. For now let us focus on the quadratic term. This 
can be diagonalised by means of the following field redefinition 



N, t = N^+ r c d^, 



h vy = -2r c \\J-d 2 + —] TT 



(765) 



52 Correcting a few typos along the way. 

53 Thc boundary effective action, r [</>], is obtained by integrating the full action, + Sbm[ 

over bulk fields, $, satisfying the boundary condition = <j>, i.e. 



■><&\aM=<t> 
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The effective action can then be written as 



r[V,^,vr] = J d 



4 Ml 



4 



—N" ( yf^d* + I)iV M + 37r( 5 2 



^ (v^ + ^ + WV. ^.4 ( 766 ) 

The next step is to write everything in terms of the canonically normalised fields 



V = ^V, 4 = ^^, tt=^M 4 ;t. (71,7) 



This gives 



r[v,4,7r] = J d A x Ihr [d 2 - ^-J K„ -\h[d 

+ 1 * x {jlS^ + 7EM r * T ) + r ^tvA>4 ( 768 ) 

Turning our attention to the interaction piece, we note that it contains terms that 
schematically take the form 

d 4 xMl^/^P(h^) a (N^) b (h yy ) c , f d i xM*y/^cp 2 (h tw ) d : (769) 



where a + b + c > 3, and d > 3. By writing these in terms of the canonically normalised 
fields one can easily check that the largest interaction comes from the term with a = 
and b + c = 3. Indeed, for \J — d 2 3> l/r c , the interaction term goes like |839U10l7] 

Tint = — — / d i x(d%) 2 d 2 Tt + sub-leading interactions, (770) 

where 

A = (M 4 /r c 2 ) 1/3 . (771) 

This corresponds to the scale at which quantum fluctuations in vacuum become strongly 
coupled. For M 4 ~ M ph and r c ~ 1/H , we have A ~ 10~ 13 eV ~ 1/(1000 km). In 
other words, for scattering processes above 10 -13 eV perturbative quantum field theory 
in a vacuum is no longer well defined. Of course, it is important to realise that we do 
not actually live in a vacuum, and it has been demonstrated that the strong coupling 
scale can be raised on curved backgrounds [962) . At the surface of the earth, the new 
scale corresponds to about 10 -5 eV, indicating the presence of large quantum correc- 
tions to the 7r field at distances below a centimetre. This will not impact much on table 
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top experiments, however, as the rr field is expected to be overwhelmed by the classical 
graviton at this scale, due to Vainshtein screening. 



The n- Lagrangian 



To see the emergence of the new strong coupling scale most succinctly, it is convenient 
to take the limit in which the troublesome tt field decouples from the graviton. This will 
also help us in our discussion of the breakdown of classical perturbation theory around 
heavy sources and the Vainshtein effect. In any event, we take the so-called decoupling 
limit [962] 

M4 

The limiting theory 



and 



A, 



fixed, 



(772) 



so that all the sub-leading interactions in Eq 
should be valid at intermediate scales, 



(770) go to zero. 



max (L4, r s ) «r -C r c , 

where L 4 ~ I/M4 is the Planck length, and r s is the Schwarzschild radius of the source 
(if present). The action goes like 



r 



d A x 



GR 



' £-k\ 



where 



-GR 



r 



Ml 



hd 2 h 

4 



[3lTd 2 TT-r 2 c (dTT) 2 d 2 TT] 



- -h 



(773) 



(774) 



and we have set the free field = 0. Note that Cgr is j us t the standard Einstein-Hilbert 
Lagrangian, expanded to quadratic order about Minkowski space. The modification to 
GR is encoded in C^. This is often referred to as the 7r-Lagrangian and much of the 
interesting phenomenology of DGP gravity, from strong coupling and Vainshtein effects 
on the normal branch to ghosts on the self-accelerating branch, can be studied using this 
Lagrangian. 

In vacuum (7^„ = 0) the 7r-Lagrangian possesses a symmetry tt — > it+a^ x^ + b, where 
This is sometimes referred to as Galilean invariance and is the 
The most general Galilean invariant Lagrangians 



and b are constants 



inspiration for galileon models 963 
and will be studied in detail in Section 14.41 

We now return to the question of the strong coupling scale in the theory described 
by Eq. (774 1 in the presence of a classical source. The first thing to note is that our 



classical background is no longer the vacuum, but the solution n — n c i(x) of the classical 
field equations 

3d 2 n - r 2 c [(d^n^d^Tr) - (d 2 n) 2 ] 



T 



(775) 

Our interest now lies in the quantum fluctuations, tt = tt c i{x) + ip(x), about this solution. 



Ml 



Plugging this into the Lagrangian given in Eq. ( 774 ) we find 
r 



Ml 
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ri(d(pyd*(p] + -<p5T, 



(776) 



3/4 



whore 

Z^(x) = 3< v - 2r 2 c (O^d" - rj^d 2 ) n cl . (777) 

For non-trivial solutions 7r c ;(x) is not constant, and so neither is Z^ v (x). However, 
assuming the background varies slowly (relative to the fluctuations) we can treat as 
approximately constant in a neighbourhood of a point in space-time. Further assuming 
that the eigen- values of Z£(x) are all of similar magnitude, ~ Z(x), we can identify a 
localised strong coupling scale, 

A* (x) ~ Ay/Z(x) > A. (778) 

As we will see shortly, for spherically symmetric solutions inside the Vainshtein radius, 
ry ~ (rgr^) 1 / 3 , we have tt c i ~ y/r s r/r c , and so Z(r) ~ (ry/r) 3 ^ 2 . On the sphere of 
radius r the local strong-coupling scale is 

Mr) ~ A (21) 

It follows that the classical background, 7r c ;, ceases to make sense below a critical length, 
^min, where r m ; n A* (r m - m ) ~ 1. This is because scattering processes would need to exceed 
the local strong coupling scale to probe the structure of the background at r < r mm . 
This short distance cut-off will typically lie well within the Schwarzschild radius of the 
source, and therefore outside of the regime of validity of the 7r-Lagrangian. 

We should also consider the implication of the local strong coupling scale for energetic 
processes taking place on the surface of the Earth. Here the Earth's gravitational field has 
the dominant effect, so we have A* ~ (\/ r c/ r -f/)10~ 5 eV, where r# ~ 1/Hq ~ 10 26 m is 
the current Hubble radius. For r c ~ rjj this means the fluctuations in the 7r field becomes 
strongly coupled at around 1 cm. However it is important to realise that it is only the 
ip self-interactions that grow large at this scale. The direct coupling to the graviton and 
the coupling to matter are negligible, with the latter going like G±/Z{r) ~ 10 _15 G4. 
Although the scalar fluctuations enter a quantum fog at the centimetre scale they do not 
spoil our classical description as this is dominated by the graviton, which is fifteen orders 
of magnitude larger [962 . 

Finally, let us note another important feature of the scalar dynamics: As Poincare 
invariance is broken by the classical background, 7r c ;, the fluctuations no longer have to 
propagate on the light cone. Indeed, one can explicitly show that angular fluctuations 
about spherically solutions will be super- luminal, causing problems for causality [9j 609J. 



The Vainshtein effect in DGP gravity 



Let us now consider the Vainshtein effect directly. As we will see, this is a mechanism 
in which an additional scalar mode is screened at short distances by non-linear interac- 
tions, thereby eliminating the troublesome vDVZ discontinuity. To see how the scalar is 
screened in DGP gravity, we consider the classical solution to the field equations around 
a heavy non-relativistic source, — diag(p(r), 0, 0, 0), assuming spherical symmetry 
for simplicity. If we are to screen the scalar and recover GR at a given scale, the classical 
graviton solution, h)iv , should dominate over the corresponding scalar solution, ■k c \. In 
other words, we should schematically have 
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Since hffi is derived from Eq. ( |773[ ), it is just the standard linearised GR solution given 
by the Newtonian potential, 



(779) 



where r s = 2G4M is the Schwarzschild radius of the source, M — J p(r)dV is its mass, 
and G4 = l/87rMj is the four dimensional Newton's constant. 



In general, the dynamics of the scalar are governed by the Field Equations ( 775 ) . It 



follows that the spherically symmetric solution, 7r c ;(r), around our heavy source satisfies 



2r| 
3r 



3r 2: 



(780) 



where we have integrated once over a sphere centred at the origin and enclosing the entire 
source. Now at large distances we expect the linear term to dominate, giving a solution 



(r) 



3r' 



(781) 



This scales in the same way as the corresponding graviton solution in Eq. ( 779 ) , and so 



we have an 0(1) modification of General Relativity. However, the linearised description 
breaks down once the non-linear piece becomes comparable, or, equivalently, when 



Or? )' 



r c , (Un) 



(782) 



Substituting our solution from Eq. (781), we see that the linearised theory breaks down 
at the Vainshtein radius, 

rv ~ (r s r 2 c ) 1/3 . (783) 
For the Sun we have r® ~ 3 km, and so 



\ 2/3 
^ 10 18 m. 



(784) 



As long as the cross-over scale is not too far inside the cosmological horizon, the Vain- 
shtein effect extends beyond the edges of the solar system, given by the Oort cloud at 
an average of 10 16 m or so. For the earth we have rf ~ 9 mm, and so 



2/3 



lCTm. 



(785) 



In each case, the breakdown of classical perturbation theory has the same origin as the 
strong coupling of quantum fluctuations in vacuum, namely the self-interaction of the 
scalar, it, with scale-dependent coupling. Note that the Vainshtein scale can always be ob- 
tained from the strong coupling scale by trading the Planck length for the Schwarzschild 
radius of the source. 

Now let us ask whether or not the scalar gets screened below the Vainshtein radius. 



For r <C ry, the linearised solution in Eq. (781) is no longer a good approximation 



219 



On the contrary, the non-linear part of Equation ( 780 ) will dominate, so that at short 
distances we have the solution 



(nonlin) 
cZ 



(786) 



This implies that at short distances the scalar correction to the Schwarzschild solution 
goes like |567j 



Given that ry — > oo as r c — > oo, we see that the scalar does appear to get screened in 
this limit, and that one is able to recover GR without any vDVZ discontinuity. 

We can now use the correction to the Newtonian force to place bounds on the cross- 
over scale. At r ~ 5 AU fractional corrections to the Sun's gravitational field should 
be < 10~ 8 |1197j . which implies r c > 10~ 4 r#. Similarly, the corrections to the Earth's 
gravitational field may have an observable effect on the precession of the moon, where 
corrections to the potential go like SV/V\ moon ~ 10 _13 r^/r c |451) . 

While we have demonstrated aspects of the Vainshtein mechanism at the level of 
the 7r-Lagrangian, for simplicity, it is worth noting that it was originally discussed using 
the full theory |399|, 1567] . In |399j it was emphasised how one must choose the correct 
expansion parameter at a given scale. Standard perturbation theory corresponds to 
performing an expansion in r s /r, but this is not a good expansion parameter in the limit 
r c — > oo, as then the next-to-leading order terms become singular. At short distances, 
the claim is that one should expand in powers of r/ry around the Schwarzschild solution 
[567], so the result can be smoothly patched onto the standard perturbative solution in 
some neighbourhood of the Vainshtein radius |399j . 

It is fair to say that the Vainshtein mechanism has not yet been fully explored as a 
viable concept. While it is clear that there is a breakdown of linear perturbation theory 
at the Vainshtein radius in both massive gravity and DGP, the arguments for a smooth 
transition to GR are at best promising, but hardly conclusive. N-body simulations 
of large-scale structure in DGP gravity, however, support the case for its successful 
implementation [11211 1269] . We will discuss this further in the next section. 

One should also worry about the elephant problen^] An elephant is an extended 
object made up of many point particles each with their own Vainshtein radius. Is the 
Vainshtein radius of the elephant the same as the Vainshtein radius of a point particle 
with the same mass located at the centre of mass of the elephant? While this may not 
be a problem when the size of the source particle distribution is much smaller than the 
Vainshtein radius, it clearly becomes an issue when the particles are well distributed over 
the whole Vainshtein sphere. How does one account for such many particle systems? The 
answer is not known and given the role of non-linearities in the Vainshtein mechanism 
the problem may well be very complicated, especially in the full covariant theory. See 
[672] for a recent discussion. 



54 We thank Nemanja Kaloper for this colourful observation. 




(787) 
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5.5.5. DGP cosmology 

In this section we describe DGP cosmology, from the FLRW background, to linear 
perturbations and non-linear studies. A thorough review of DGP cosmology up to 2006 
can be found in |834j . 



As discussed in Section 5.4.1 Shiromizu, Maeda and Sasaki |1140j found the AD Ein- 
stein equations of a 3-brane world embedded in a 5D bulk with Zi symmetry. Applying 
this formalism to DGP we get 



Gfiv = (167rGr c ) 2 IV - E^, 



where 



and 



n 



jl, V 



1^ 

4 



3T Q/3 f Q/3 - f 2 



Tny Till/ 



The Bianchi identities give 



W U E^ = (167rGr c ) 2 V,n^, 



(788) 
(789) 
(790) 

(791) 



while the matter stress-energy tensor satisfies local energy-momentum conservation: 
V„T^ = 0. 

The cosmological solutions for an FLRW space-time were found by Deffayet |394| by 
considering the embedding of the DGP brane into a Minkowski bulk. This corresponds 
to setting = and plugging an FLRW metric into our field equations (788). Note 



that the global structure of the cosmological solutions has been investigated by Lue [833J , 
while some exact solutions in special cases have been found by Dick 420 . If one were 
to generalise the bulk to include 5D Schwarzschild, one can run into an unusual class of 
singularity on account of the branch cut in Friedmann equation [1142, 559J. Cosmological 
branes in generalised DGP gravity can also mimic a phantom equation of state [1086, 835J. 

Staying with the simplest case of a Minkowski bulk and a tensionless brane, the 00 
component of the AD field equations (with E^ v — 0) gives the Friedmann equation as 



9 n e / „ k 8nG 



(792) 



where e = — 1 for the normal branch, and e = 1 for the self-accelerating branch. Clearly, if 
p = then the normal branch implies Minkowski space, while the self-accelerating branch 
gives de Sitter space with Hubble constant — — . This self-accelerating model has 
been proposed as an alternative to dark energy |398j . but as we have already seen, the 
theory contains a ghost in its perturbative spectrum. A phenomenological extension of 
Eq. ( 792 ) has been conside red b y Dvali and Turner 
term is added to Eq. 

After using Eq. 
huri equation, as 



(7921. 



where for n = and e = 1 the 



( 792 ) the ij component can be re- worked into a modified Raychaud- 



It is instructive to cast the background equations into a form resembling an effective 
dark-energy fluid with density pe and pressure Pe- This gives 



w e = — = - d V„ 9 : 3B a2 ■ (796) 



X = 8nG PE = -Jh 2 + ^, (794) 
r c V a 1 

and 

dH , Ofj 2 _|_ 2« 

F = 8ttGP e = -e— — ; (795) 
so that we can define the equation of state, we, as 

Pe _ g +3# 2 + ?? 

The Bianchi identities, and energy-momentum conservation, ensure that 

d ^+3H(pE + P E )^0. 

As a consequence, during radiation domination we get we ~ — §, during matter domina- 
tion we get % ~ — |, during a possible spatial curvature dominated era we get w^; rj — |, 
and eventually during the self-accelerating era we get we — > — 1. 

Let us now look at the perturbed FLRW universe in DGP. Perturbation theory for 
DGP was originally worked out in detail by Deffayet [395], and has since been devel- 
oped by a number of authors. Although brancworld cosmological perturbation theory is 
discussed in detail in Section |5.4.3[ , we will now present the formalism for DGP gravity 
explicitly, in the interests of self-containment. 

Let us consider scalar perturbations in the conformal Newtonian gauge. The compo- 
nents of are they^j 

E° = -He, (797) 

E\ = -V&e, (798) 
I 

3' 

We find the perturbed Einstein equations are then given by 

2 (A + 3k) - m ($' + WS) = A D 8irGa 2 p5 + B D a 2 p E , (800) 
(&+WB) = A D 8irGa 2 (p + P)0 + B D a 2 O E , (801) 

*" + WZ' + m<$>' + ( 2H' + U 2 + ^A J f - Qa + «.)$ 



E l 3 = -he?; + D* 3 .E B . (799) 



A D ^Ga 2 5P - -2- 



a 2 

(1 + w E )A D {8irGa 2 p5 + a 2 he) - yM£ 



(802) 



55 Note that this is consistent with our general analysis for brane world cosmological pertur- 
bations, provided one identifies fi E = -8-nG 4 p weyl 5 weyl , B = -8ttG 4 (^p we « l ) 8 we y l , S B = 
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$ - * = 



\{X + 3Y)8nG{p + P)E - 3£ B 
3 + r 2 c (X + 3Y) : 



(803) 



where we recall that primes denote differentiation with respect to conformal time, and 



H = — = aH . Note that we have set 



2Xrj 
2Xr?-3 



and Br> = 



2Xr 2 -3 



= An - 1, 



for simplicity. In the limit r c — >■ oo one recovers the familiar perturbation equations of 
General Relativity from the above. 

We can now use the Bianchi identities to find the field equations for /i£ and Qe- 
They are 

Me + - A6 B = 0, (804) 

and 



Q' E + m® E - \he + (1 + we) (he + 3-0i 
3 V a 



A + 3k f 4 



l+w E 



(2 + 3iu e ) $ - > =0 



(805) 



It is clear that the above equations are not closed due to the presence of the bulk 
anisotropic stress, Several authors have applied a variety of approximations to 

solve the linear perturbations in the DGP model. Sawicki and Carroll [1101] assumed 
that the Weyl perturbations are zero. It is clear, however, from Eq. (805) that this is 
not a consistent approximation. 

To fully determine the DGP perturbations one must use the five-dimensional equa- 
tions. For maximally symmetric bD space-times, the bulk scalar mode perturbations can 



be deduced using a single master variable, f2, as we discussed in Section \5A\ Assuming 
that the bulk cosmological constant is zero, the bulk metric is given by Equation (696), 
with 



X a p — dz 2 



J u2 i dH \ 

n(t,z) = l+ e{ JJ + ^>\z\ 



n 2 (t,z)dt 2 , 
r(t,z) = a(l + e^/lP+J\z\). 
Using the Master Equation ( |705[ ), we find 

A + 3k n 

~~b 2 b 3 



d 


' 1 dCl' 


d 


n dfl 


dt 


nb 3 dt 


dz 


b 3 dz 



n = o. 



(806) 
(807) 

(808) 



The energy- momentum tensor of the Weyl fluid on the brane can then be related to fl. 
For the case of a spatially flat universe (k = 0) one finds 



Me 



Qe = 



1 k 4 
"3a 5 



S R = - 



z=0 

1 k 2 fdVL 
3^ (~dt~ 
1 



2=0 

dn 



6a 3 V dt 2 dt 



3 dH dCl 
H~dt~dz 



(809) 
(810) 
(811) 



z=0 
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where k is the 3-momentum on the homogeneous background. Thus, in general one has 



to solve Eq. (808 1 with appropriate initial and boundary conditions, and then use Eq. 
(8111 in the perturbed Einstein equations. In practise, one can however apply various 
approximations. 

Koyama and Maartens |737) assume the small-scale approximation k/a 3> r c ,'H.. 



Under this assumption Eq. (804 1 implies that Qe = 0. In quasi-static situations we also 
have d t £l ~ and jj^jf -C ^, so that the master equation becomes c?^0 — 2j ^d z f2 — 

,2 

-4-2 £1 = 0. Assuming that the solution of this last equation is regular as z — > oo, it can 



then be shown that fl — ftbrO- + eHz) "H when aH/k <C 1. Therefore, using Eq. (811) 



one finds that [j,e = 2rS^ on the brane, in the quasi-static limit. Inserting this into 



Eq. (805) gives J^e, and therefore /is, in terms of the potentials. This in turn allows us 



to eliminate all the Weyl perturbations, to get 



and 

- fc 2 * = 4ttG (\ + pa 2 S M , (813) 

where j3 — 1 + 2eHr c WE- 

Sawicki, Song and Hu |1103j propose a scaling ansatz to close the Bianchi identities, 
and solve the master equation on the self-accelerating branch near the cosmological hori- 
zon. This allows them to move away from the quasi-static regime. The ansatz they use 
is O = A(p)a s G(z/zi lor ), where z^ OT = aH J Q a gsjffgp , and s is an exponent that is ap- 
proximately constant during times when a particular fluid dominates the expansion. The 
master equation then becomes an ordinary differential equation with independent vari- 
able z/zhor, and can be solved iteratively by assuming the boundary conditions G(0) = 1 
and G(l) = 0. They find that for super-horizon modes s — 6 during radiation domi- 
nation, decreasing to s = 4 during matter domination, and finally approaching s = 1 
during A domination. For all sub- horizon modes s — 3, which reproduces the quasi- 
static approximation of Koyama and Maartens |737| . By iteration, one can correct for 
the time-dependence of s, as is necessary during transitions between cosmological eras. 
Song has performed a similar analysis on the normal branch |1160j . 

Cardoso et al. [246] solve the master equation numerically by employing null coordi- 
nates in the {t, z} plane, thus obtaining the linearised DGP solutions without resorting to 
any approximations. They find that the quasi-static approximation is valid to within 5% 
for k > O.OlhMpc" 1 . Seahra and Hu |1123j develop analytic solutions for both branches 
of DGP based on the scaling ansatz, and compare these with numerical solutions in |246j . 
They find that the analytic/scaling solutions are accurate to within a few percent all the 
way to the horizon, and therefore that the use of the scaling ansatz in the observational 
constraints imposed in [4821 1817| is justifiable. 

The FLRW background evolution on the self-accelerating branch has been exten- 
sively tested, using a variety of different cosmological observations. Alcaniz [20] used 
measurements of the angular size of high redshift compact radio sources [577] to place 
constraints on r c . He finds that Ar 2 H i > 0.29 at la, with a best-fit value of r c ~ 0.94ff ( ^~ 1 . 

Jain, Dev and Alcaniz |658[ find that gravitationally lensed QSOs require r c > 1.14-ETq 
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at la. Deffayet et al. |403j use SN-Ia from [1034] , and the angular diameter distance 
to recombination of pre-WMAP CMB data, while assuming a flat universe. Their SN 
analysis gives 4r2 1 g2 = 0.17~}1 'q 2 at la, which translates to r c = 1.21^^11^. In- 
cluding CMB data increases the preferred value of Ojvfj and leads to a best fit model 
with r ~ IAHq 1 . Fairbairn and Goobar g75] use the SNLS SN-Ia data [57] and BAO 
data |467j to show that the self-accelerating model is not compatible with a flat universe 
at the 99% level. For the generalised model of (453] they find that —0.8 < a < 0.3 at la 
(the self-accelerating model corresponds to a = 1). Maartens and Majerotto |846) used 
SN-Ia jT064l I67| . a CMB shift parameter [EST] ), and BAO data [467] to place constraints 
on the self-accelerating FLRW background. They find that the self-accelerating model 
is consistent with these data sets to within 2a, but is worse fit than ACDM. This puts 
tension on the self- accelerating model, but as they point out, it is not necessarily reliable 
to use the BAO data for models other than ACDM, as ACDM is used throughout the 
analysis in [467] , Rydbeck, Fairnbairm and Goobar |1080j repeated the analysis of [467] 
by including SN-Ia data from ESSENCE [893 1H280] , and constraints on the CMB shift 
parameter from WMAP-3 |1173| to disfavour the self-accelerating model at the 1 — 2a 
level, depending on whether a peculiar velocity error on the SN-Ia data is included or 
not. The inclusion of BAO data from |467j further supports this result. 

Moving away from using distance measurements alone, a number of studies have been 
performed on the growth of linear structure using the small-scale quasi-static approxi- 
mation of Koyama and Maartens |737j . Song, Sawicki and Hu [1 161] used the angular 
diameter distance to recombination from WMAP-3 |1173j . SN-Ia data from [1064. 67J, 
and constraints on the Hubble constant to exclude the spatially flat self-accelerating 
model at 3a. By allowing non-zero spatial curvature, however, they show that consis- 
tency with the data can be improved, but is still marginally worse than spatially flat 
ACDM. They then use data coming from BAOs, the linear growth of structure, and 
ISW and galaxy-ISW correlations to show that any self-accelerating models that shares 
the same expansion history as a quintessence-CDM models are strongly disfavoured. As 
these authors argue, one must properly take into account the perturbations on the self- 
accelerating branch in order to make firm conclusion. Xia considered the generalised 
model of Dvali and Turner [453J in conjunction with Union SN-Ia data set [733], CMB 
distance measurements, GRB data |1106j . and a collection of data on the growth of linear 
structure [5JH IT2U31 HUTTl [5551 13321 1877] . Using the quasi-static approximation 

of Koyama- Maartens [737] to calculate the growth function, Xia finds a < 0.686 at 2a. 
This excludes the self-accelerating value of a = 1. 

Fang et al.. 482 perform a thorough test of the self- accelerating model using the 
PPF approach of [6271 [630 . Using CMB and large-scale structure data these authors 
find that the spatially flat model is a 5.3er poorer fit than ACDM, while that open model 
is 4.8cr worse. In the latter case non-zero spatial curvature improves the fits for distance 
measurements, but worsens those involving the growth of linear structure. One may 
speculate that changes to the initial power spectrum may be able to improve the this 
situation, but the required reduction in large-scale power also produces unacceptable 
reductions in power in the large-scale CMB polarisation spectrum. Changes of this type 
cannot therefore save the model. In a follow-up study Lombriser et al. [817] constrained 
both branches of the DGP model by employing the PPF description of the CMB and 
large-scale structure. These authors find that either brane tension or A is required for 
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these models to fit the data well, but that both cases the best fitting models are practically 
indistinguishable from ACDM. They further hnd that the cross-over scale is H r c > 3 if 
spatial curvature is included, and Hqt c > 3.5 in the spatially flat case. 

The growth index on small scales, 7, has been calculated by Wei using a Taylor series 
expansion in the fractional density parameters SIdgp an d Q K [1263] . Ferreira and Skordis 
have also calculated this quantity, within a more general frame-work of modified gravity 



models, to find 7 
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0{Q 3 DGP ) [BUT] . This analytic result 



is in excellent agreement with numerical studied] 

Although the DGP model is strongly constrained by the observations we have already 
discussed, it is still instructive to consider the substantial work that has been performed 
on constructing and studying non-linear structure formation scenarios in these models 
[EMI 11122 IM1IIII1IIIIIJISS3 Lue, Scoccimarro and Starkman 

|836) have considered spherical perturbations on sub-horizon scales, and derived the grav- 
itational force law in a collapsing top-hat model embedded in an expanding background. 
They find that the non-linear CDM density contrast evolves as 
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Q,m°~m- Their model uncovers a transition point 
at eo ~ 1, below which gravity behaves as in GR (in accordance with the Vainshtcin 



mechanism). This gives a Vainshtein radius of = 



16GMr' 
9j3 2 



1/3 



, where M is the mass 



of the spherically symmetric object. This result was subsequently re-derived by Koyama 
and Silva |741j without the restrictive assumptions of 836]. It can be shown that the 



solutions to Eq. (814 1 are a factor of two larger than the corresponding solutions in 



theories that obey Birkhoff's law, and that have similar expansion histories. 



The form of G e ff in Eq. (815) is due to an additional degree of freedom in DGP: The 



brane-bending mode, as uncovered in |839[ I963j . To expand upon this we may rewrite 
the RHS of Eq. (814 1 using AttG c s&Pm = A^>. The potential can then be written in 
terms of a Poisson potential, \&p, and a brane-bending mode, ip, as ^ = \tp. These 

two new potentials, "fp and ip, satisfy A^p = 8ttGSpm, and Aip = 87r(G c ff — G)5pM- 
Ignoring non-local contributions, the brane-bending mode can then be shown to obey 
the following equation in the decoupling limit, and in the sub-horizon and quasi-static 
regime: 
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(816) 



This equation is closely related to the Field Equation (775), following from the tt- 
Lagrangian discussed in Section [5. 5. 4| 



56 The terms j^VIdgp an d t^^DGP in the Wei [1263] result are incorrect, and appear to have come 
from erroneously dropping terms proportional to Qdgp when going from Eq. (21) to Eq. (22) of that 
paper. 
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Schmidt, Hu and Lima [1113] used the spherical collapse model in DGP to study 
the halo mass function, bias and the non-linear matter power spectrum. They find that 
top-hat spherical collapse in DGP requires a new, more general method for defining the 
virial radius that does not rely on energy conservation. To obtain the comoving number 
density of halos per logarithmic interval in the virial mass, and the linear bias, they use 
the Sheth-Tormen method |1137j , while they use the Navarro- Frenk- White [946] form for 
halo profiles. In this way they find that the spherical collapse model agrees well with the 
halo mass function and bias obtained from N-body simulations, for both the normal and 
the self-accelerating branch. For the non-linear power spectrum, the spherical collapse 
model in the self-accelerating branch also matches the simulation results very well. This 
is not true, however, for the normal branch, although even in this case the spherical 
collapse model predictions are better than those obtained from HALOFIT |1156j . 

N-body simulations of DGP have been conducted by three independent groups: 
Schmidt [TTTJl [TTTT] , Chan and Scoccimarro 269 , and Khoury and Wyman [550] • The 
general result, common to all of these studies, is that the brane-bending mode on the 
self-accelerating branch provides a repulsive force that greatly suppresses the growth of 
structure, while the opposite effect occurs on the normal branch. All three simulations 
also display the Vainshtein effect. 

The N-body simulations conducted by Schmidt were on both the self-accelerating |1112j , 



and the normal branch [llllj . Rather than assuming Eq. (815 1, Schmidt use a relaxation 



solver on Eq. (8161 to show that the Vainshtein effect is recovered without making any 
assumptions about symmetry. Specifically, he finds that the Vainshtein effect is weak- 
ened for non-spherically symmetric situations, and in general sets in at smaller scales 
than is found in [690]. Like |690j . however, he also finds that the HALOFIT model 
does not correctly describe the non-linear DGP matter power spectrum. Schmidt then 
proceeds to calculate the halo mass function, and shows that the abundance of massive 
halos in self-accelerating DGP is much smaller than in CDM models. This last result 
puts strong constraints on the self-accelerating model from cluster abundance measure- 
ments, independent from of the CMB and large-scale structure constraints discussed 
above [482, 817J). For the normal branch, structure is enhanced, and the abundance of 
massive halos is larger than a CDM model [llllj . In this case, the halo profiles were 
also obtained, and departures from the predictions of GR were seen outside of the halo 
virial radius. Finally, Schmidt calculates the bispectrum in both the normal and self- 
accelerating branches. The self-accelerating (normal) branch bispectrum is found to be 
enhanced (suppressed) for equilateral configurations, but not for squeezed configurations. 
This is in agreement with |269j , and illustrates the diminishing strength of the Vainshtein 
effect for squashed matter configurations. 

Scoccimarro |1121j derives the linear and non-linear equations for the growth of struc- 
ture in DGP without using the Mukohyama formalism. This results in a set of equations 
that includes non-local terms. For example, in the quasi-static limit it is found that 



A 



($ + *) = 8irGa 2 p6, (817) 



rather than A($ + \&) = 8nGa p5, as implied by adding the quasi-static expressions 



given in Eqs. (812 1 and (813). Scoccimarro finds that in the linearised quasi-static 
limit the bulk behaviour decouples from the brane behaviour, and thus that the non- 
local operators can be safely ignored. This ensures the validity of the Koyama-Maartens 
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result 737 on small scales. On larger scales, however, the non-local terms become more 
important. On very scales the linear approximation for the brane-bending mode breaks 
down, and Scoccimarro finds a non-local and non-linear equation for the potential, and 
the density perturbation, 5. Chan and Scoccimarro [269 perform N-body simulations by 
accounting for the non-local operators y/—A and l/y— A. These operators contribute to 
the equation for the brane bending mode, <p, and the potential, ^> |1121j . They uncover 
the Vainshtein mechanism through a broad transition around k ~ 2/iMpc -1 for z = 1, 
and k ~ 1/iMpc -1 for z = 0. They also compute the non-linear matter power spectrum 
and bi-spectrum, the CDM mass function, and the halo bias. The results of all this are 
in broad agreement with those of Schmidt [11121 11111) . 

The simulations of Khoury and Wyman |690j were improved upon in 1290], where 



Eq. (816) was solved. It was found that in DGP, and higher dimensional cascading 
gravity models, peculiar velocities are enhanced by 24 — 34% compared to CDM |780j . 
This corresponds to an enhancement by four orders of magnitude in the probability of 
the occurrence of high velocity merging system such as the "bullet cluster" . 

Scoccimarro |1121j and Koyama, Taruya and Hiramatsu [743] have also developed 
two independent techniques based on higher-order perturbation theory in order to find 
the non-linear power spectrum. Scoccimarro |1121| shows that the non-linearities coming 
from the brane-bending mode can be described by a time and space dependent gravi- 
tational constant. He then goes on to develop a re-summation scheme to calculate the 
non-linear power spectrum and the bi-spectrum. Koyama, Taruya and Hiramatsu [743] 
develop a general perturbation theory method that can be applied in the quasi non-linear 
regime of any theory that has an additional scalar degree of freedom (such as Brans-Dicke, 
f(R) and DGP). For the case of DGP, they find that their perturbative method recovers 
the extreme non-linearity of the Vainshtein mechanism. Their technique has compared 
with the HALOFIT mapping [IToS] . and the non-linear PPF fit of Hu and Sawicki [BlO] , 
It is found that HALOFIT over-predicts power on small-scales, in agreement with the 
findings of |690l 111121 II 111 j , while the non-linear PPF fit works well within the regime 
of validity of the perturbation theory. 

5.6. Higher Co-Dimension Braneworlds 

An important characteristic of any braneworld model is the brane's co- dimension. 
This is given by the difference between the dimension of the bulk and the dimension of 
the brane. Up to this point we have been concentrating on co-dimension one models. 
These are by far the most well developed, mainly because they are much more tractable. 
In this section we will discuss braneworld models with co-dimension n > 2. In this case 
the gravitational description typically becomes much more complicated. 

We have already seen higher co-dimension branes in Section |5.2.1| when discussing 
the ADD model |62| in six or more dimensions. This is a particularly simple example 
of a higher co-dimension braneworld set-up, corresponding to a tension-less brane in a 
Minkowski bulk. In general, of course, one must consider more complicated scenarios, 
including the full gravitational back-reaction of brane sources, both in the bulk and on 
the brane. Indeed, attempts to solve for a higher co-dimension delta function source 
in Einstein gravity will generically result in a singular bulk geometry unless the energy- 
momentum of the source is given by pure tension [316] . This is problematic if we insist on 
using that infinitely thin defect as a model of our 4-dimensional Universe. However, in a 
realistic set-up the defect will not really be a delta function source (it will have some finite 

228 



thickness, and the would-be singularity will be automatically resolved). Alternatively, 
one could consider a genuine delta function source and avoid issues with the singularity 
by including higher-order operators in the bulk gravity theory [4491 11741 12831 1282j . 

Despite being extremely difficult to study, higher co-dimension braneworld models 
have been discussed in the literature (see, for example, [256 1 l3l6 l IT2501 15551 [TBI 14481 [449 | 

1022, 348, 349 ). Much of the interest in this field lies in the fact that these models offer 
new ways to think about the cosmological constant problem [2561 13161 112501 [TBI 450, 518J. 



For example, in Section 5.6.2 we will briefly review the importance of higher co-dimension 
branes in the degravitation scenario |61j . 

Co-dimension two branes are particularly interesting as there is reason to believe 
that within them one may be able to realise self-tuning of the vacuum energy. This 
is because a maximally symmetric co-dimension two brane generically behaves like a 
cosmic string in the bulk by forming a conical singularitjj^J Changes in tension then 
alter the deficit angle, as opposed to the geometry, of the defect |316) . Braneworld 
models with two extra dimensions shaped like a rugby ball have been developed with 
this in mind [256J . The curvature of the brane is now completely determined by the bulk 
cosmological constant and the magnetic flux, independent of the brane tension. One 
might then infer that phase transitions on the brane that alter the brane tension do not 
alter the curvature. This is not the case, however, as such a transition can affect the 
brane curvature via the backdoor, by altering magnetic flux, so there is no self-tuning 
1250, 965 . The situation can sometimes (but not always |214[ 1215"] ) be improved by 
including super-symmetry and dilaton dynamics in the bulk to protect the relationship 
between the bulk cosmological constant and the flux [T3J 12161 12171 12141 1215) . A low scale 
of bulk super- symmetry breaking, set by the size of the rugby ball, can then account for 
a small amount of dark energy |216) . Other notable contributions to the literature on 
rugby ball compactifications with fluxes and/or scalars include the following: A detailed 
discussion of the brane-bulk matching conditions |213lH2T| . an analysis of the low energy 
effective theory |708j . and a study of cosmological evolution on the brane [1021] . 

The rugby ball described above represents a compact extra dimension, with poten- 
tially troublesome/interesting light moduli fields 18, 19]. In any event, it is worth pursu- 
ing alternative braneworld models with non-compact extra dimensions if they are able to 
reproduce 4D gravity at some scale. Co-dimension two branes in Gauss-Bonnet gravity 
will be discussed in this context in Section [5. 7. 4| On the other hand, one might expect 
that an infinite bulk should be compatible with some form of guasi-localisation of grav- 
ity by considering generalisations of DGP gravity |454j in any number of dimensions. 
Such generalisations are sometimes referred to as Brane Induced Gravity (BIG) models 
[4481 14491 14501 EH [4361 HHU [3861 EZH [3381 EH [6691 ESH] . In the simplest scenario we can 
consider BIG models on a single 3-brane of co-dimension n, described by the following 
action [448] : 

S = ^— / d 4+n xV=jK + / d 4 x^R. (818) 



bulk * J bulk 

351 



Using higher derivative operators to resolve the singularity in the bulk propagatoiPj we 



57 There are exceptions that give rise to curvature singularities as opposed to conical, even for maximal 
symmetry on the brane |1208| . 

58 Note that this theory becomes higher-dimensional at large distances, so the finer details of how the 
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find that for n = 2, 3 the Green's function in momentum space along the brane takes the 
form |449| 

G(p,0) = i+s-, (819) 



where 



r>(p,o) 

0) = / (2tt)« p 2 + g 2 + (p 2 + <Z 2 ) 2 /M 2 „ ' (820) 

and where M„„ < represents the regularisation scale. Taking M uv ~ 10 -3 eV, 
this theory reproduces normal 4D gravity at intermediate scales M~* <C r -C r C) but 
is modified at sub-millimetre scales and at very large distances. It has therefore been 
dubbed seesaw gravity in |449j . The large distance cross-over scale can be computed 



from Equation (819) 



&y/\a.{M uv r c ) for n = 2 



M. 

M*i nzz , Q ( 821 ) 

forn = 3. 

Unfortunately, these constructions suffer from the presence of ghosts in the spectrum of 
fluctuations [436J . 

5.6.1. Cascading gravity 

More sophisticated BIG proposals make the transition from 4 + n dimensional gravity 
to 4 dimensional gravity via a series of intermediate steps, (4 + n)D — > (4 + n — 1)D — > 
...^5D^AD (SHU ESS E3SI ESS EMI US] . This could potentially help with the 
ghost problem. These are sometimes referred to as cascading gravity models as the ef- 
fective description tumbles down a cascade of extra dimensions as we come in from large 
distances, and increase the resolution of our description. Let us now turn our attention 
to this class of gravity models. 

The Kaloper-Kiley model 

The first braneworld model to explore a cascade from 6D — > 5D — > AD gravity 
was developed by Kaloper and Kiley 669 , 668 . They considered a landscape of models 
describing BIG on a 3-brane in six dimensions. The singularities are resolved by modelling 
the 3-brane as a cylindrical 4-brane with compact radius tq. The brane has tension 175, 
and some induced curvature weighted by a 5D Planck scale, M5. The effective AD 
tension and Planck scale at distances r 3> r are then given by o~^ s = 2irr a o- 5 and 
Af| ff = 27rr M|, respectively. Note that an axion flux is used to cancel the vacuum 
pressure in the compact direction q 2 — 2a '5^. 

The resulting vacua resemble a cone in the bulk, with the conical singularity cut away 



(see Figure 12). The metric is given by 

ds% = rj^dx^dx" + dp 2 + fipfdcf 2 , (822) 



theory behaves there will be sensitive to how we resolve the singularity in the bulk 448 . 
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Figure 12: Taken from 669 . An illustration of the 2D bulk geometry formed around the resolved brane 
in vacuum. 




where 

f(p) 



The brane tension does not affect the geometry along the brane's non-compact di- 
rections. Instead, it controls the deficit angle measured at infinity, according to b — 
2c5ro/-Mf , where M§ is the fundamental scale of gravity in the bulk. When the tension 
lies below a critical value, er cr i t = Mf/2ro, the deficit angle is less than 27r, and the bulk 
space is infinite. For sub-critical branes, 05 < <7 C rit, the theory generically resembles 
the seesaw gravity theory. We have 4D (Brans-Dicke) gravity at intermediate scales 

r < r < r c , and a crossover to 6D gravity at a scale r c ~ (i-fc)M 2 \/ m (2(l ~ b)r c /ro). 
Note that the scalar fluctuations naively indicate the possible presence of a ghost, but this 
conclusion cannot be trusted since the perturbative theory breaks down due to strong 
coupling [669) . This is consistent with the conclusions drawn in |435| for thick brane 
regularisations of seesaw gravity. 

Things are much more interesting in the near critical limit, for which b = 1 — e, where 
< e < 1. In this case the brane lies inside a very deep throat, such that the angular 
dimension is effectively compactified up to distances of the order rg/e (see Figure 13). 
It turns out that perturbation theory is under much better control in this limit, and 
that there are no ghosts. The improved behaviour is due to the fact that we make a 
series of transitions from AD — > hD — > 6D gravity. Indeed, the theory looks like four 
dimensional Brans-Dicke gravity at scales tq < r < r c , becoming five dimensional beyond 
r c ~ M| fi /Al^rQ. The transition to six dimensions only occurs at very large distances, 
r > ro/e, at which point the cylinder decompactifies. 

Note that this model does not represent a solution to the cosmological constant prob- 
lem. Instead it recasts it in terms of a fine-tuning of the hierarchy between cross-over 
scales at each transition. 
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Figure 13: Taken from 669 . An illustration of the 2D bulk geometry formed around the resolved brane 
in vacuum, in the near critical limit. 



The cascading DGP model 

A more recent BIG model exploring these ideas was developed by de Rham et ai, 
who coined the phrase cascading gravity 381, 386, 379 . In the simplest model |381j . one 
has a DGP 3-brane within a DGP 4-brane within a six dimensional bulk. This situation 
is described by the following action: 



S 



Mi 



d 6 x^/-g 6 R 6 



Ml 



bulk 



Mf 



4-br 



(823) 



3-branc 



where (g§)AB, {g^)ab and {gn)^ are the metrics in the bulk, on the 4-brane and on the 
3-brane, respectively. The corresponding Ricci scalars are given by Rq,R5 and R4. The 
model contains two important mass scales corresponding to the following ratios: 



m 5 



Mf 
Mf 



and 



m 6 



Ml 
Ml 



(824) 



The claim is that the intermediate DGP 4-brane can help to resolve the singularity in the 
bulk propagator at the location of the 3-brane .381 . Let us now consider fluctuations 
due to a conserved source, T^ u , about a bulk and branes that are all Minkowski. The 
field can then be decomposed in terms of a scalar, tt, and a transverse and trace-free 
tensor, hJJ , such that 

= lijjj + irr]^ + gauge terms. (825) 
Working in momentum space one finds that the Fourier transformed fields are given by 
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(828) 



The amplitude between two conserved sources on the brane, and T', now takes the 
surprisingly simple form 
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Ml 
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(829) 



The last term represents the contribution from the scalar amplitude, whereas the remain- 
der is the contribution from the massive spin-2 field. The coefficient of scalar amplitude 
changes sign as we flow from the IR to the UV, 



p 2 +g{p 2 ) 

p 2 - 2g(p 2 ) 



— ^ asn^O 
1 asp^oo, 



(830) 



indicating the presence of a ghost in the UV theory, consistent with the results of 
To eliminate this ghost we must add additional operators to the 3-brane action that 
modify the scalar propagator in the UVp"| It turns out that this can be achieved simply 
by adding tension, A, to the 3-brane |381l 1388] . As the 3-brane is co-dimension 2, its 
vacuum solution is unaffected by the change in tension (it is still Minkowski). In contrast, 
the 3-brane tension does alter the profile of the fields in the bulk by creating a deficit 
angle. This means that kinetic terms describing fluctuations on the new vacuum now 
receive corrections from the non-linear bulk interactions. The result is that the scalar 
propagator on the 3-brane is modified. One might expect that such effects will necessarily 
be suppressed by the five and six dimensional Planck scales, but this is not necessarily so 
because of strong coupling of the scalar in the five dimensional boundary effective field 



(831) 



theory [38111388] . 

In any event, at higher energies, p ^> ttiq, the modified amplitude goes like 
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For large enough tension, A > 



, the scalar amplitude always has the correct sign, 



so there is no ghost. However, the tension cannot be arbitrarily large since we require 



59 Note that the various formulae for propagators and amplitudes in 381, 386 differ. One must take 
m 5,6 — ► 5 m 5,6 m g° m g from 13811 to 13861 . The difference is because only one half of the 6D bulk, and 
one half of the 4-brane are considered in I386| . This is a perfectly legitimate truncation by Z2 symmetry 
across each of the brancs. Here, however, we will adopt the conventions of 381 when expressing our 
formulae. 

60 ft has also been argued that thickening of the brane can help to eliminate the ghost in this model 
[386] 
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that the bulk deficit angle is less than 27r. This places an upper bound A < 2-kMq = 
2"Km^m^M.\. Provided m 6 < 777,5 we therefore have a window, 



< A < 27rm5 mg M 4 , 



(832) 



for which the theory is ghost-free in the UV [3811 1388j . The condition m 6 < 777,5 has 
added significance in that it permits a transition from AD to 5D at energies p ~ 7715 . In 
particular, we have AD scalar-tensor gravity in the far UV, for p > 7715 > mg, 
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and 5D scalar-tensor gravity for 7715 > p > me, 
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In the far infra-red, p < m^, we recover 6D gravity, 
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(834) 



(835) 



Thus we have a cascade from 61? — > 5D — > AD gravity as we move from large to short 
distances. It is interesting to note that the existence of this cascade seems to be closely 
related to the absence of ghosts. By taking M4 ~ M p i, M 5 ~ 10 MeV and M 6 ~ meV, 
the cascade occurs at the current horizon scale, as m 6 ~ m 5 ~ Hq. Note that a cascade 
from seven dimensions has also been studied in some detail in [387] . 

For tensions in the range allowed by Eq. ( 832 ) the AD theory at short distances is a 



Brans-Dicke theory that is incompatible with solar system constraints. The authors of 
381 suggest that some sort of Vainshtein mechanism could account for the recovery of 
GR at short scales. Indeed, the role of non-linearities in cascading DGP has not yet been 
properly investigated, and one may be concerned that there are issues surrounding the 
validity of the linearised results (as was found in the sub-critical regime of the Kaloper- 



Kiley model [669] ). 

Another aspect of cascading DGP that requires further study is cosmology (see 
|900l [T2"] for some preliminary work). In [12\ a five dimensional proxy theory is used 
that is expected to capture many of the salient features of the original model. This 
proxy model corresponds to a 51? scalar-tensor theory that, in the limit where the scalar 
and tensor degrees of freedom decouple, agrees with the decoupling limit of the 5D 
boundary effective field theory of cascading DGP presented in 381 . Of course, it is by 
no means unique in this respect. 



Induced gravity on intersecting braneworlds 



A third BIG model that is very closely related to cascading DGP was developed by 
Corradini, Koyama and Tasinato |338| [333- Here a six dimensional bulk contains two 
intersecting DGP 4-branes, each with induced curvature. At the intersection we place a 
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DGP 3-brane, also with some induced curvature. The cosmological evolution is derived 
using a formalism based on mirage cosmology [685 . The resulting Friedmann equation 
on the 3-brane is then given by |339j 



2MlH 2 



QMik 




4M fi 4 tan" 1 



tana 




(836) 



where a is the angle between the two 4-branes, and fc 2 is a constant that encodes in- 
formation about the warped geometry in the bulk (and can be derived in a non-trivial 
way from the parameters of the theory). The model claims to admit self- accelerating 
and self-tuning vacua. For fluctuations on a Minkowski 3-brane one may also expect a 
cascade from 61? — > 5D — > AD gravity as we move from large to short distances, for suit- 
ably chosen scales. A thorough perturbative analysis has yet to be done that confirms 
this expectation, or the possible existence of ghosts and strong coupling. 

5.6.2. Degravitation 

Higher co-dimension braneworld models and cascading gravity are expected to play 
an important role in realising the degravitation scenario |61j . The idea of degravitation 
is best understood by rephrasing the cosmological constant problem. Instead of asking 
"why is the vacuum energy so small?', we ask "why does the vacuum energy hardly 
gravitate at all? 1 . In other words, we accept what our particle physics models are saying 
and take the vacuum energy to be up at the (TeV) 4 scale or beyond. We then try to 
develop a gravity theory that prevents this large vacuum energy from generating a large 
amount of curvature. 

A phenomcnological description of degravitation is given by |61j 



g- 1 {L 2 n)G ia , = ^r iafi 



(837) 



where Newton's constant, G, has been promoted to a differential operator, Q(L 2 \3), 
depending on a length scale, L, and the covariant d'Alembertian operator, □. The idea is 
that this operator behaves like a high pass filter characterised by the scale L. Sources with 
short characteristic wavelengths, I <C L, pass through the filter and gravitate normally. 
Sources with long characteristic wavelengths, I ^ L, such as the cosmological constant, 
are filtered out and hardly gravitate at all. These considerations amount to imposing 
the following limits: Q -> G for L 2 (-U) -> oo and Q -> for L 2 (-U) -> 0. 

Since the covariant derivative does not commute with the d'Alembertian, Equation 



( 837 1 suggests that the energy- momentum tensor is not conserved. However, it is impor- 



tant to realise that this equation is only expected to describe the linearised dynamics of 
the helicity-2 mode of the graviton, given by g^ v = r/^ + hp V . The source is conserved 
with respect to the full metric, g^ v = r\^ v + h^ v + . . ., where the ellipsis denotes the 
additional (Stuckelberg) modes that necessarily appear in a fully covariant theory |452j . 
It is clear that the extra modes must play an important role in the filtering process at 
long wavelengths. The transition to normal gravity for short wavelength modes occurs 
precisely because these modes get screened by the Vainshtein mechanism |1241| (see our 
discussion of the Vainshtein mechanism in the context of DGP gravity in Section 5.5.4 1. 
The filter operator is often parametrised as follows: 



g-\L 2 u) = 



i - 
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m 2 (D) 
□ 



G- 



(838) 



where m 2 (D) oc L 2 ( Q_1 )D a , and < a < 1. The upper limit is a necessary, but not 
sufficient, condition for degravitation, whereas the lower limit is required by unitarity 
|452) . Massive gravity (see Section 3.3.31 corresponds to a = 0, whereas DGP gravitjj^ 
(see Section 5.5) corresponds to a = 1/2. It turns out that models with co-dimension 
n > 2 are expected to correspond to a = 386 . 



Using Equation (837) as its starting point, degravitation can be demonstrated in a 
cosmological context by the ratio of scalar curvature to A scaling as t -1 / 2 for a broad 
class of filters, where t is the proper time of comoving observers [452 . However, the 
corresponding cosmological solution arising from a higher co-dimension brane scenario 
has yet to be found explicitly. 

Working at the level of the phenomenological equations, N-body simulations for de- 
gravitation scenarios have been studied by Khoury and Wyman [690J . Their simulation 
is based on a modified Poisson equation, given by 



1 fkL\ 



2 a 



* = AirGp. 



(839) 



Thus, the emergence of the Vainshtcin mechanism is encoded in the simulations via Eq. 



(815), rather than being recovered from the simulation itself. They find that the matter 



power spectrum determined from the simulation agrees well with the HALOFIT |1156j 
formula for k <~ 0.2/iMpc -1 . However, for smaller scales the HALOFIT formula over- 
predicts power because HALOFIT does not capture the Vainshtein effect. Nevertheless, 
they find a way to recalibrate the HALOFIT parameters such that it agrees with their 
simulation on all scales. 



5. 7. Einstein Gauss-Bonnet Gravity 



In Section 2.4.1 we outlined how Lovelock's theorem 1831, 832 constrains the class 



of theories that can be constructed from the metric tensor alone: In four dimensions, 
the most general rank-2 tensor that can be derived from a variational principle and is 
(i) symmetric, (ii) divergence free, and (iii) built out of the metric and its first two 
derivatives only is given by a linear combination of the metric and the Einstein tensor, 
i.e. 

1 (840) 



c(4D) 



Such a term arises from the variation of the Einstein-Hilbert action in the presence of a 
cosmological constant, 



1 



'9 



d A x^ 



-g(a + aiR) 



(841) 



In four dimensions the Einstein-Hilbert action therefore gives the most general field 
equations with the desired properties. In more than four dimensions, however, this result 



61 The full DGP theory does not exhibit degravitation, but this is not necessarily the unique theory 
with a = 1/2. 
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no longer holds. For D = 5 or 6 dimensions the most general rank-2 tensor satisfying 
the same three conditions is given by |831] (see also [7671 F1313J) 

g(5D/6D) _ _^ ao7ab + aiG ab + a 2 H ab , (842) 

where (in D dimensions) j ab is the metric, Q ab = Ti-ab — hT&iab is the Einstein tensor, 
and we have introduced the Lovelock tensor [831] 

H ab = 2KK ab - m aa n c b - 4K acbd K cd + 2K acde K b cde - ^Gj ab , (843) 

where G = TZ a bcdT^ abcd ~ ^R a bT^ ab + and 7l a bcd is the Riemann tensor constructed 
from 7 ft. The Lovelock tensor is obtained by variation of the Gauss- Bonnet action, 

1 5 f ,d. 



n «» = ^6^J d XV ^°- (844) 

These results can generalised still further as the dimensionality of space-time is increased. 
To see how, note that d D Xy/~'j7l and dPxyJ—^G correspond to the Euler classes of order 
one and order two [936J , and are topological in D < 2 and D < 4 respectively^] To 
generalise Lovelock's theorem to even higher dimensions we must then add the higher 
order Euler classes to the corresponding action, and compute the metric variation. The 
Euler class at order k depends on kth powers of curvature. In D dimensions we must 
therefore include Euler classes up to order \(D — l)/2], where the square brackets denote 
the integer part. The reader is referred to [270J , and references therein, for further details. 

5.7.1. Action, equations of motion, and vacua 

In five or six dimensions General Relativity is just a special case of a broader class of 
theories commonly referred to as Einstein-Gauss-Bonnet (EGB) gravity. These theories 
are described by the action 

S = J cPx7=7 (n - 2 A + a<G) + J d D xC m { lab ,i>), (845) 

where C m is the Lagrangian density of the higher dimensional matter fields that are 
minimally coupled to the metric. The corresponding field equations are given by 

Gab + Ajab + aH ab = 8irG D T ab , (846) 

where T ab — — ^= j d D xC m {^ ab , iji) is the energy-momentum tensor of the matter 
fields. As we will see shortly, although A acts like a bare cosmological constant, it differs 
from the effective cosmological constant, A e gf, seen by the geometry. 

The Gauss-Bonnet corrections are weighted by the parameter a. This has dimensions 
of [length] 2 and is often associated with the slope parameter, a', in heterotic string the- 
ory. To see why, consider the effective theory describing the dynamics of the heterotic 



62 Onc can check explicitly that the Einstein tensor is identically zero in two dimensions, and that the 
Lovelock tensor is identically zero in four dimensions. 
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string on the 10 dimensional target space. Working at tree level in the string coupling 
and performing a perturbative expansion in the inverse string tension, fij, 1 ~ 2na' , 
we see that the leading order term gives the standard Einstein-Hilbert action, and the 
next to leading order term, after some field redefinitions, gives the Gauss-Bonnet ac- 
tion |566[ [886 . In fact, one may have expected such a result based on the fact that 
string theory is known to be ghost free. As noted by Zwiebach [1316 , second-order 
corrections to the Einstein-Hilbert action will necessarily give rise to ghosts unless they 
appear in the Gauss-Bonnet combination. Note that if we do identify EGB with a stringy 
generalisation of General Relativity, we should restrict attention to a > for consistency. 

Maximally symmetric vacua 

We now consider the maximally symmetric vacuum solutions, satisfying TZ a b c d = 
("facjbd — %d%c), where A c g is the effective cosmological constant seen by the 



(D-l)(D-2) 

curvature. There are two possible values for A e g, given by 



2A 

Acs 



A e ± ff = A cs i± A /i-J^_], (847) 



where 

_ 1 (D-l)(D-2) 
AcS -~4a(D-3)(D-4Y {M8) 

For these vacua to be well defined the bare cosmological constant must satisfy the bound 
A/ Acs < 1/2- It is easy to check that A^/Aqs > 1 > A~ ff /Acs, with equality when 
A = Acs/2- This is known as the Chern-Simons limit (at least in odd dimensions) 
|1297[ [344 , and corresponds to the case where the two roots coincide. Note that only 
the lower root has a smooth limit, A^ — > A, as a — > 0, and as such is often referred 
to as the "Einstein" branch. In contrast, the upper root, Ai, is not smooth as a — > 0, 
and represents a distinct new feature of EGB gravity that is completely absent in higher 
dimensional General Relativity. For this reason, this branch is often referred to as the 
"stringy", or "Gauss-Bonnet", branch. 

We now consider metric perturbations about these vacua, 7 a f, = 7 a (, + #7 a i,. The 
linearised field equations then take the remarkably simply form 

SGab + AcffSjab = 8nG cS T ab: (849) 

where bQ a b is the linearised Einstein tensor. Thus, we have perturbative Einstein gravity 
with an effective Newton's constant given by 

1 Acs 

Assuming that the bare Newton's constant is positive, Go > 0, it follows that pertur- 
bative gravity on the Einstein branch (A e s/Acs < 1) is essentially well behaved, as 
G c s > 0. In contrast, on the Gauss-Bonnet branch (A c g/Acs > 1), we have G e g < 
indicating the presence of a perturbative ghost |279j . We refer the reader to the closing 
paragraphs of Section [5. 5. 3| for a discussion of the pathologies associated with ghosts. 
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The above conclusions regarding stability are robust provided we can trust our ef- 
fective perturbative description. Here we expect this description to be valid at energies 
up to a cut-off, -Ecut-off ~ l/CGeff) 1 ^ 0-2 . As we approach the Chern-Simons limit 
(A — > Acs/2), where the two branches coincide, it follows that the cut-off for the effec- 
tive description should have the limit -©cut-off — > 0. This indicates strong coupling, and a 
breakdown of perturbation theory. To analyse the stability of either branch close to this 
limit one must study non-perturbative phenomena such as instanton transitions. This 
reveals that transitions between branches are unsuppressed in the near Chern-Simons 
regime, and that there is very strong mixing between the two (almost degenerate) vacua. 
We conclude that neither of them can accurately describe the true quantum vacuum state 
in this regime, as both will quickly become littered with bubbles of the other vacuum 

Spherically symmetric solutions 

Static spherically symmetric solutions to the vacuum field equations were first dis- 
covered by Boulware and Deser [176J, 

dr 2 

ds 2 = -V{r)dt 2 + ^— + r 2 dn D _ 2 , (851) 
V(r) 

where (KId-2 is the metric on a unit (D — 2)-sphere. We have two branches for the 
potential, V(r), given by [17611279] 



where is the volume of the unit (D — 2)-sphere. M is an integration constant 

that one can identify with the mass of a spherically symmetric source (possibly point- 
like). As above, the lower root corresponds to the Einstein branch, and has a smooth 
limit as a — > 0. The upper root corresponds to the Gauss-Bonnet branch, and does not 
have a smooth limit. Of course, the two roots are degenerate in the Chern-Simons case 
(A = Acs/2). 

We shall stay away from the Chern-Simons limit in the remainder of this section. Note 
that a generalised form of Birkhoff 's theorem now holds that guarantees that Equation 



(851) represents the most general spherically symmetric solution, even without the as- 
sumption of staticity |272j . Let us consider the properties of this solution for a source 
of positive mass, M > 0. On the Einstein branch the singularity at r — is shielded 
by an event horizon, and there are no obvious pathologies. This is not the case on the 
Gauss-Bonnet branch, where we have a naked time-like singularity. 
At asymptotically large radii the metric functions take the form 

2A ± fF r 2 M , s 

v *»- ( J -i)V a) +1± A c ,w--- - (853) 

The asymptotic solution here can be seen to resemble the generalised Schwarzschild- 
Tangherlini solution in D dimensional GR |1202j . with cosmological constant A^ s , and 
mass ±M. In particular, on the Gauss-Bonnet branch, it appears as if our solution has 
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negative gravitational energy even for a positive energy source. This conclusion, however, 
is incorrect. A proper computation of the gravitational energy taking into account all 



of the Gauss-Bonnet corrections reveals the mass of the solution in Eq. (|851|) is +M on 
both branches 



, 1010 . For an in-depth discussion of this, and related stability 
issues, the reader is referred to |279j . For an excellent review of Lovelock gravities, and 
their black hole solutions, see |270j . 

5.7.2. Kaluza-Klein reduction of EGB gravity 

In Section [5 . 1 . 1 1 we discussed compactification of higher dimensional GR on a circle. 
The same operation can now be performed with the EGB action, given in Eq. ( 845 ) . We 
start in five dimensions with coordinates {x^ , z), where the z direction is compact. We 
can then dimensionally reduce down to four dimensions using the following ansatz 



7p 



and 



= e 2 \ 



(854) 



where the metric, g^ u , gauge field, and the dilaton, <fr, are all independent of z. 
Integrating out the compact dimension we arrive at the following effective theory |915j : 



S s & = n I d 4 xy/-g£ cS , 
16nG 5 J 



where 



(855) 



£eff 



R 



; 20 p2 



ae 



3<t> 



2A + a(R fiVa pR> wa P 
3 



e 20 ( F ^ F , a p apF l^ 



4R^R^ + R 2 ) 
1 



2\2 



- 2(V fi F a p)(V a F^) - 2(V»F afl )(V u F av ) - \2{F m V a <f){F^V p<j>) + 6F 2 (V0) 5 

+4(F a0 \7^)WF afj - 4(F a ^V M 0)V Q! F^ - 12(F aP W^)W ^ 

+ ae* [8e _ *V M V v e* - 2e 2<s 'F iia F v a \ + Zae 2 * [2F Q/S 'F^V M V ' a e* + F^e*] , 

(856) 

and F 2 = F^F^, with F^ = V ^A„ - V^A^. Note that for A = and a = wc 
recover the results of Section |5.1.1[ provided we perform a conformal transformation to 
the Einstein frame and canonically normalise the dilaton. 



One property of the Lagrangian in Eq. ( 856 ) is that it gives rise to field equations that 



are at most second order in derivatives. This is, of course, inherited from the underlying 
theory, and enables us to consider an interesting class of scalar-tensor theories by freezing 



the gauge field such that A^ — 0. We refer the reader to Section 3.1 and [35], for further 
discussions on four-dimensional theories of this type. 



5.7.3. Co-dimension one branes in EGB gravity 

Interest in EGB gravity has really taken off in recent years, largely due to its appli- 
cation to braneworlds. We discussed the braneworld paradigm in some detail in Section 
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|5.2| Schematically, the physics of 3-branes in five dimensional EGB gravity is more or 
less the same as in General Relativity, as we will now discuss. 

Generic action and equations of motion 

As in the GR case discussed in Section |5.3.4| we split our five dimensional bulk into 
a series of domains separated by a series of 3-branes, so that the action is given by 

Ml 

^(?l + «G) + 4uik 



S= / a?x^i 

'bulk 



brancs 1 



J d A xV~g {-A [Ml (K + 2a(J - 2G^K^))] + £ branc } , (857) 

J branc 



where Af 5 = {I/SttG^} 1 ^ is the bulk Planck scale. In comparison with Equation (615) we 
have a Gauss-Bonnet correction in the bulk and the corresponding Myers boundary term 
|931j on each brane. The latter depends on the extrinsic curvature, K^, the induced 
Einstein tensor G^ = — ^Rg^, and the trace J = J M „, where [363J 

J M „ = i {2KK m K™ + K^K^K^ - 2K ^K^ ] K >„ - K 2 K^) . (858) 

The bulk field equations are then given by 

n ab - ^R lab + aU ab = ^Ta6 Ulk , (859) 

with the boundary conditions at the brane given by the "DGW" junction conditions 
[H3 ESS] (see also [II]), 

A {Ml [K^ - K 9lut + 2a(3J^ - J 9fl „ - 2P liavf) K^)\ } = -T^ ane . (860) 

Here we introduce the double dual of the Riemann tensor, defined as [902] 

P^e = -R"\ + 2#V^] - 2RV l« s %] - m \J v f>] ■ ( 861 ) 

531. 



In what follows we will assume that the bulk geometry is onl}] |sourced by a cosmological 
constant T^ h ulk = Mf A7 ab , which we take to be negative, as in the Randall- Sundrum 
model, A = — 6k 2 . We will consider a single brane with tension a, and some induced 
curvature, M 2 nd R. Its energy momentum tensor is then given by 

T h ™ n ° = -°9^ - M 2 nd G^ + % v , (862) 

where 7^„ denotes the contribution from any additional matter excitations on the brane. 
For simplicity we will impose Z2 symmetry across the brane (see [10121 11013j for a dis- 
cussion of asymmetric configurations). 



For generalisations with bulk scalar fields and bulk Maxwell fields see 1271 1 and |805| . respectively. 
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Weak gravity on a Minkowski brant 



Let us seek Randall-Sundrum-like vacua, corresponding to a Minkowski brane in an 
AdS bulk, 

als 2 = % b dx a dx b = dz 2 + e-^/^rt^dx^dx". (863) 

The effective AdS curvature in the bulk is then determined by the bulk equations of 
motion ( 859 ) . We find 

1 

W ' 




(864) 



where the lower root corresponds to the Einstein branch and the upper root to the GB 
branch. Meanwhile, the vacuum DGW junction conditions at z — impose the following 
constraint on the brane tension 



.Mf ( , 4a 



Since we are interested in weak gravity on the brane, we consider small fluctuations in 
the metric and the brane position. As we saw previously, the linearised equations of 
motion are identical to those found in perturbative General Relativity, 



SQ ab - w S lab = 0. (866) 

'off 

The linearised junction conditions also take a remarkably simple form, 

2(MffS(K? - K5£) = M 2 c JGt - Tf, (867) 
where we identify the effective five dimensional Planck scale, 

(Mf)*=Ml(l-p) (868) 



/ 

'err- 



and the rcnormaliscd induced curvature scale, 



Ml^Ml d + ^l. (869) 

«eff 

The effective Plan ck scale is consistent with the effective Newton's constant introduced 
in Equation (850), (M£ s ) = (l/STrGeff) 1 / 3 . It follows that our weak gravity description 
is identical to the corresponding description in General Relativity, provided we make use 
of the effective cosmological constant and Planck scale in the bulk, and renormalise the 
induced curvature on the brane. For further details the reader is referred to [270] . and 
references therein (see also |413| ). 

The characteristic behaviour of weak gravity on the brane is as follows [270 : At 
large distances we recover four dimensional General Relativity, with an effective four 
dimensional Planck scale 



\/M 2 nd + Mll cS (l + Aa/l 2 cS ) = ^Ml a + {Mffl cS . (870) 
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This holds provided l/l e g 7^ 0. For l/l e g = the large distance behaviour is five dimen- 
sional owing to the absence of a normalisable zero mode in the Minkowski bulk. 

At short distances we recover four dimensional Brans-Dicke gravity, with a differ- 
ent effective Planck scakQ M^ v = y / M^ nd + 8aAf|// cff = M ron , and a Brans-Dicke 
parameter given by 

2,„ 4. S - 3 ( Ml d /l cS + M!(l+4a/P cS ) \ 3 / {Mpf) 2 \ 

+ 4 I Ml(l Aa,ll s ) ) ~ 4 \{Mffl cS ) ' m) 

This holds provided M ron 7^ 0. If A/ rcn = then the short distance behaviour is five 
dimensional since there is no effective induced curvature giving rise to quasi-localisation 
near the brane. 

These results are consistent with the behaviour that would be expected in linearised 
theory on a brane in five dimensional General Relativity, with bulk cosmological con- 
stant, A e ff = 6/l 2 s , bulk Planck scale, M| ff , and brane induced curvature scale, M ren . 
It is interesting to note that the Brans-Dicke parameter gets large close to the Chern- 
Simons limit as Mf- S — > 0. However, as we saw earlier, quantum fluctuations in the bulk 
become strongly coupled at ~ Mf , so this prediction may be unreliable. 



Brane cosmology in EGB gravity 



The methods used in Section [574] to derive the cosmological evolution of co-dimension 
one branes apply equally well to branes in EGB gravity, provided we use the DGW 
junction conditions give in Eq. (860 1. Recall that brane cosmology can be studied using 



either the brane-based formalism or the bulk-based formalism. The brane-based formalism 
relies on the covariant formulation of the effective Einstein equation on the brane. This 
has been worked out for EGB gravity and applied to cosmological branes [859] (see also 
|709j ). In the bulk based formalism, the generalised form of Birkhoff's theorem [272] 
ensures that the bulk geometry around a FLRW brane is given by 



ds 2 = -V{r)dr 2 



dr 2 
W) 



r 2 qijdx l dx\ 



where V(r) takes the form 



V±(r) 




(872) 



(873) 



and qij(x) is the metric of a 3-space of constant curvature, k = 0,±1. Note that this 
generalises the black hole solution given in Eq. ( 852 1 to different "horizon" topologies 
|233j . Let us once again assume 1i symmetry across the brane for simplicity. As in 
Section 5.4.2 we treat the brane as an embedding r = r(t), and r — a(t) in the bulk 



geometry, identifying t with the proper time of comoving observers on the brane, and 
a(t) with the scale factor. After imposing the DGW boundary conditions given in Eq. 



There is a typo in the corresponding expression for M^ v in |270| . 

243 



( [860] ), we find that the Hubble parameter, H = a/a, obeys [270] (see also [5551 [2721 [5571 
851,7TB]) 



4a 

Y 



v + 4 

v a 2 



V{a) 



H 2 



V(a) 



Pbt 



6Mf 



3Mf nd 



-(874) 



6M 5 3 

This can be recast as a cubic equation in H 2 + j%, and solved analytically to give a 
modified Friedmann equation. The most commonly studied scenario has M; n( j = 0, with 
the bulk taken to lie on the Einstein branch. We then find |363) 



H 1 



K 
7/2 



8a 



where 




a(cr + p) 2 , (v + p) [a 
2M| Mf V 2 



2/3 



(875) 



(876) 



This cosmology can give rise to rapid inflation, just as in Randall-Sundrum cosmology 
|806j . However, the Gauss-Bonnet corrections do introduce some new features at the 
level of cosmological perturbations. For example, in Randall-Sundrum cosmology the 
consistency relation between the tensor-to-scalar ratio, r, and the tensor spectral index, 
tit, agrees with the standard GR result, r — —8ut- This relationship is broken by the 
GB corrections as the amplitude for the tensor perturbations are no longer monotonically 
increasing with scale [437] . We also find that observational constraints of GB brane 
inflation are typically softened relative to Randall-Sundrum cosmology, such that certain 
"steep" potentials are no longer ruled out |1224| . 

5.7.4- Co-dimension two branes in EGB gravity 



In Section 5.6 we mentioned that higher-order operators could be added to the 
bulk gravity to regularise the singularities that appear when we have an infinitely thin 
braneworld of co-dimension > 2. One of the ways to do this is to add Gauss-Bonnet 
corrections in the bulk [1741 1283 , 282 . The simplest and most well studied scenario is to 
consider co-dimension-2 branes in six dimensional EGB gravity without any additional 
sources in the bulk (see, for example, [T7J H77J ME ME\ IMD [MB GHE1 EH] ) • This gives 



S 



bulk 



-(K + aG) 



d^xC 



branc- 



brane 



16ttG 6 

If we assume axial symmetry in the bulk we get 

ds 2 = 'f AB dx A dx B = dr 2 + L 2 (x,r)dd 2 + g^x, r)dx^dx v , 
and it turns out that the Einstein tensor on the brane is given by [1741 1277] 

G[IV 



(877) 



(878) 



(879) 
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whore 

W„ v = K*K vX - KK^ + ^g^(K 2 - K XrT K x °), (880) 

and = \d r g^ v . The parameter (3 is the deficit angle on the brane, and is often 
assumed to be constant, although this need not be case |277j . A field dependent deficit 
angle, (3 = f3(x), will lead to two important effects: Transfer of energy between bulk and 
brane, and an effective four dimensional Newton's "constant" that can vary as G c g = 
8-kol(i-!3(x)) 1^77| . These can clearly be constrained by observation, but it would be very 
interesting to study the role varying /3 could play in attempts to self-tune the vacuum 
curvature in co-dimension 2 models. 

The function f((3) depends on the mathematical technique used to derive the bound- 
ary conditions at the distributional source |2771 16761 1675j . The boundary conditions 
derived in |174j lead to the condition W Mr/ | brane = 0, suggesting that Einstein gravity 
should be recovered on the brane at all scales, even for an infinitely large bulk. However, 
it is now understood that these conclusions rely on the assumption d r K^ v = at the 
brane, which is too constraining. For constant (3 the Friedmann equation on the brane 
is given by [277] 

tt2 k 87rGcff 1 c2 , > 

tl + -j = ^ Pbrano - T7. h -g g, (881) 

where c 2 is an integration constant. However, as we already emphasised, in general j3 can 
vary, and in this case one cannot find a closed system of equations on the brane |277] . 
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6. Parameterised Post-Priedmannian Approaches and Observational Constraints 

As expounded in this review, there now exists a vast range of candidate theories of 
gravity that modify Einstein's theory of General Relativity in one way or another. It is 
also seems clear that many more such theories are likely to be proposed in the future. 
If these theories are to be of any value in understanding and resolving the problems 
associated with the Dark Universe then they must be confronted with cosmological data. 
This is, in principle, straightforward but time-consuming. It involves working out the 
perturbation equations for each and every theory, incorporating them into the Einstcin- 
Boltzmann solvers and N-body codes, and calculating a list of observables. 

This situation is analogous to the experimental study of General Relativity in the 
early 1970s, during what has become known as the 'Golden Age' of General Relativity. 
There, one had a plethora of alternative theories of gravity that needed to be confronted 
with constraints from Solar System measurements. The Parameterised Post-Newtonian 
(PPN) method was invented in this case as an intermediate step between theory and 
experiment. It involves a set of generic parameters that can be easily constrained by 
experiments |1207j . Using the PPN method one can then take any given theory, calculate 
the PPN parameters it predicts, and compare them with observational constraints. This 
process is outlined in Section [53J 

Over the last few years, the idea of creating such an intermediate step when con- 
sidering cosmological constraints has starting taking hold. It has been dubbed by some 
the 'Parameterised Post-Friedmannian approach', and it attempts to encompass, at the 
linear level, the behaviour of a wide array of alternative theories of gravity. We will now 
outline the basic idea behind this approach. 

The bulk of the cosmological data that can be used to constrain modifications of grav- 
ity can be interpreted in terms of perturbations about a Friedmann-Lemaitre- Robertson- 
Walker universe. Throughout this review we have presented how the evolution of cosmo- 
logical perturbations is modified in these theories, relative to their behaviour in General 
Relativity. One can now ask oneself if there is a general way of modifying the equa- 
tions of cosmological perturbations such that it will encompass all the theories we have 
previously discussed. 

The simplest approach, that has been in vogue for the past few years, is to modify 
two of the four Einstein field equations as follows: 

-2/c 2 $ = 8Tr^Ga 2 Y,Px[Sx + 3{1 + w X )U0 x ] (882) 
x 

= £$, (883) 

where two new functions have been introduced: The effective Newton constant, G c g = 
/iG, and the gravitational slip, £. One can interpret G c g as the inclusion of a form of grav- 
itational screening, reducing (or enhancing) the local gravitational force on cosmological 
scales. The gravitational slip phenomenologically parametrises the shear that seems to 
arise frequently in scenarios of modified gravity. This parametrisation is incredibly useful 
for quantifying deviations from General Relativity, and a number of authors have used 
it in their analyses of cosmological data QUI [5371 EH EH HQ HEH IMS 023 H502] . 

It is instructive to see in what circumstances such a parametrisation might arise, and 
to do this we will develop a consistent formalism in what follows. 
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6.1. The Formalism 

In order to generalise the perturbed Einstein equations we follow the approach and 
notation used in |1149|, 111514 I501|, [77] . Here we split the field equations of the theory 
in question into a set of evolution equations for the metric, evolution equations for the 
additional gravitational fields (if any are present), and a set of constraint equations. The 
evolution equations for the metric, and the constraint equations, can then be written 
schematically as 

8G™ d = 5G M „ - 5U M „ = 8irG [ST^ + 5T®] , (884) 

where 8G^ V is the perturbed Einstein tensor, 5Un V is the contribution of any other terms 
that involve perturbations of the metric, 5T^ U is the perturbed energy-momentum tensor 
of matter fields in the space-time, and 5T^ V is the contribution of any terms that involve 
perturbations to the additional gravitational fields. 

Let us now be more specific. We can define the following new variables: J7a = 
-a 2 SU° , VJJq = -a 2 5U° l , U P = 5U\ and D^U? = a 2 {5U i j - iSU^j), as well as the 
new gauge invariant 

fs-^+W*), (885) 



2.6 



where the Bardeen potentials $ and arc defined in Eqs. (85) and (86) of Section 
Dropping the hats (i.e. working in the conformal Newtonian gauge), we can write the 
two constraint equations coming from Gqo and Goi as 



-2/c 2 ($ + 3?4r) = 8irGa 2 J2pxSx + U A (886) 

x 

2kT = 8irGa 2 ^2( Px + Px)O x + U B , (887) 
x 

and the two evolution equations coming from the trace and traceless parts of Gy as 

/ 1.2 \ 1.2 1 

kT' + 2kHT + \U' -U 2 - —j^ + —<$> = 4vrGa 2 ^<5Px + -C/ P 



= 87TGa 2 ^2{p x +Px)^x + U^(889) 



x 



We can then combine Eqs. ( |886[ ) and ( |887 ) to find a modified Poisson equation 



2fc 2 $ = 87rGa 2 ^ p x [S x + 3W(1 + w x )0 x ] + U A + 3UU&- (890) 



x 



Assuming that the theory in question has a t m ost A -tim e derivatives in its field 



equations, and bearing in mind from Eqs. (85), (86) and (885) that $ and T have one 



time derivative when expressed in an arbitrary gauge, the components of the tensor U a p 
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can be written as 



JV-2 



Up = 



Ut 



k 2 ~ n A,M n ^ + E n fM 

n=Q 
N-2 

n=0 
N-l 

5^ k 2 - n [C„$ (n) + J n f < n - 

n=0 
V-l 



n=0 



(891) 
(892) 
(893) 
(894) 



where = 4^1, and similarly for f . The coefficients A n -J n depend on time and 
scale through the scale factor, a, and wavenumber, k. For the sake of brevity we will 
refrain from explicitly stating these dependences. 

Now, although we have defined U above in the Newtonian gauge, the individual terms 
appearing in the expressions above are all gauge invariant. This fact, however, imposes 
further constraints because the gauge-invariant variable T contains second derivatives of 
the scale factor, when expressed in an arbitrary gauge. Hence, to avoid higher derivatives 
of the background appearing in the field equations, we have to set 



E 



N-2 



= F, 



N-2 



= 1 



N-l 



= Jn-i = 0. 



(895) 



We assume that the evolution equations for the matter fields remain unchanged, and that 
these equations are supplemented by additional evolution equations for the extra gravi- 
tational fields. Finally, the field equations are closed by imposing the Bianchi identities. 
This imposes one of two possible options: (i) V a T^ = V a U a p — 0, which imposes a 
series of constraints such that the theory remains consistent, or (ii) V Q (T^ + U a p) — 
0, which is the more general situation. For a detailed discussion of these issues see 
[11391 ITT5T1 177] . 

6.1.1. Evolution of perturbations on super-horizon scales 

In principle it may seem that one should explore the long wavelength behaviour of 
cosmological perturbations on a case by case basis. It turns out, however, that the 
infinite wavelength mode can be studied simply by considering the evolution of the back- 
ground equations. This observation is due to Bertschinger |148j . and proceeds as follows. 
Consider a background with scale factor a(r, k) where k is the spatial curvature of a 
hyper-surface of constant r: 



ds 1 



-dr 2 + d X 2 



1 



sinh^v^xjdfi 



(896) 



where sinh K (cc) equals sin(a;) for k > 0, equals x for k — 0, or equals sinh(x) for k < 0. 
We can now perturb this space-time as k — > k(1 + S K ) , and compensate by a change the 
coordinates r — > t + a and \ x(l — \$k)- Here S K is a constant, while a = a(r). Note 
that in this case the scale factor a(r, k) is perturbed as a(r + a, k(1 + S K )). In words, the 
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scale factor will come out as the solution of some generalised set of Friedmann equations, 
and will depend on the spatial curvature, k. 

We can now write this new geometry in the form of a perturbed FLRW metric, with 
background curvature k: 



ds 2 



(l + 2*)dr 2 + (l-2$) 



d X 2 



1 



smhl(^/Kx)dn 



(897) 



where 



*(r) 
$(r) 



d In a 

«1 — ° K + ° 

1 d In a 

2 ~ <9hiK 



"Ha, 



Ha. 



(899) 



One can now eliminate a to find a generic evolution equation that relates 5 K with $ and 
without specifying any particular theory of gravity: 

= $ - * 



1 d 



a 2 dr V H 



J__d 
2a dr 



ainH 



dbxK 



(900) 



where entropy perturbations have been neglected^ The constant 5 K that remains in the 
equation above has a direct physical interpretation: It is twice the comoving curvature 
perturbation. Note that we have also assumed that shear perturbations are negligible on 
large scales, that local energy-momentum us conserved, and that spatial gradients can 
be discarded. By choosing H(a, k), and a relation between $ and <3/, one now has the 
evolution equation completely defined. This is a powerful statement, as it means it is 
possible to determine the evolution of large-scale perturbations without delving into the 
details of the theory. 

However, in order to complete the system here one still needs to specify a relation 
between $ and "J. As we will describe below, it has become standard practise to assume 
the simplest form of the PPF parametrisation, that we described above, on super-horizon 
scales. Within this approach comparisons have been made between choices of fi and £, 
and the outcomes of numerical solutions for specific theories. These comparisons show 
reasonable agreement, but, as yet, there is no compelling argument for applying the 
simplest PPF parametrisation on large scales. In other words, there is no guarantee that 
a simple relation of the form W = can encompass all possible theories of gravity. In 



fact, it would appear that Eq. (900) docs not allow such an interpretation. 



6.1.2. The simplified PPF approach, and its extensions 

Let us now revisit the simplest, and by far the most popular, version of the Param- 
eteris ed Po st Friedmannian approach [T48l [2371 f630j l627]l4T]. Here one considers only 
Eqs. (889) and (890). The further assumption that perturbations have no anisotropic 



stress (i.e. = 0) then allows one to reorganise them in the form of Eqs. (882) and 
(883). These equations can then be rewritten in terms of the U a p tensor by choosing 



U A + 3HU e = 2k 2 



Including entropy perturbations is straightforward, and for this the reader is referred to |148| . 
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The coefficients A i: B i} Ci and Di must now be chosen to satisfy this condition. 

It is understandable why this approach is popular, as it has a number of benefits. 
Firstly, it is applicable in the quasi-static regime the arises when k — > oo (i.e. on small 
scales relative to the cosmological horizon). Furthermore, the perturbation equations 
now form a closed system. This means that if, for example, we restrict ourselves to a 



dust-filled universe then Eqs. (|882|) and (8831, together with 



S' M = -fc 2 M + 3$' (901) 
Q' M = -H6m + *, (902) 

form a complete system of differential equations that can be straightforwardly solved. 

It is instructive at this point to specify the various versions of the simplified PPF 
formalism that are currently in use: 

• Caldwell, Cooray and Melchiorri |237[ 1360) introduced the systenj^] 

A* = 1 

C = w = tu a 3 , (903) 
which was later extended to |361j 

H = l + /i a 3 . (904) 

• Bertschinger and Zukin (1481 1151] proposed a reduced parametrisation that takes 
into account the conservation of long wavelength curvature perturbations. Only 
one parameter is considerecj^j 

Ibz = 1-0 (905) 

that is further refined as ^bz = 1 + P aS , where j3 and s are constants. These 
authors make the additional assumptions that jbz depends only on time, even 
on sub-horizon scales, and that the Bertschinger long-wavelength construction (see 



Section 6.1.11 can also be extended to sub- horizon scales. The latter of these as- 
sumptions allows them to solve for $, from which \l/ and 8 are then determined. The 
assumption of scale-independence is later relaxed further, such that the following 
parameterisation can be made 

1 + ftfcV 



IBZ 



1 + /3 2 k 2 a s 
G* = /i(l -Q = G 

where and fti are constants P°j 



1 + aik 2 a s 
1 + a2k 2 a s 



° 6 The initial convention was that but the authors changed their convention in 

si 0Af 

subsequent publications. 

67 The symbol used was actually 7, to which we have added a subscript "BZ" to distinguish it from 
other parameters with the same name. A similar approach will be used with the other frame-works 
presented here. 

68 We kept the original symbol G$ here, although under our conventions it would be more accurately 
written as G^, as it plays the role of an effective gravitational constant for a modified Poisson equation 
for *. 
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Amendola, Kunz and Sapone [H] modify the Poisson equation and the slip by 
considering the two functions Qa and t]a' 



Qa = M, 

VA = -C, (906) 

while a third variable, ~Ea = (3.4(1 + 77^4 /2) , is also introduced in order to simplify 
the relevant expression for gravitational lensing: $ + ^ = 2E^$. A comparison of 
this approach is made with DGP and scalar-tensor theories, both on sub-horizon 
scales. 

Zhang et al. [1301] make the parameterisation 

1 

G eff = Ml-C/2). (907) 
These authors also introduce the Eq statistic, which we describe further below. 

Zhao et al. [1302] modify the Poisson equation using Vt. They introduce the two 
parameters 

1 

VZG - 1 _ > ) 

Vzg = /i(l-C). (908) 

as well as a third derived parameter S^g = I^zgO- + Vzg)/^ = £a. Further 
refinements are made by considering two specific models: A case where fizG — 
Hzg(t) and rjzc — '?zg( t ) transit from their GR values in the early universe to 
modified constant values today, and a case where the two functions are pixelised 
in the {r, k} plane by 2 x 2 pixels per function. 

A parametrisation specifically designed for small scales was proposed by Amin, 
Blandford and Wagoner [33], where 

B A (U k ,a) = (l-C)/* = A>(a)+A(a)ft* + /3a(a)«£ + ... 
T A (H k ,a) = /i = 7o(fl)+7i(a)H fe +7 2 (a)H^ + ..., (909) 

while the matter density fluctuation is likewise parametrised by 

6 m (a, k) = S(k)i [60(a) + &i{a)U k + 8 2 {a)Ul] , (910) 

where 8(k)i is specified by initial conditions. In this approach different theories 
correspond to different sets of the functions {ft, 7,, 8i}. The authors find the ap- 
propriate functions for ACDM, scalar-tensor theories, quintessence models, f(R) 
theories, and DGP. They also note, however, that not every theory can be ade- 
quately matched to this expansion (e.g. k-essence). 
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The simplified PPF approach, though useful as a phenomenological tool that can 
be used to constrain fi and £, is not without its problems. For a start, it is not clear 
what theories a specific choice of parameters actually encompasses. In jl 1491 15011 177] 
this problem has been addressed in detail, and it is shown that if one wishes to consider 
theories with second order field equations only then the PPF equations become: 

-2/c 2 $ = 8irnGa 2 Y,Px[6x +3{l + w x )H9 x } (911) 
x 

= (912) 
k 

with the constraint fi = 1 — In other words, if the field equations of the theory are 
second order then the simplified PPF approach is not applicable. 

Let us now consider what the simplified PPF approach does correspond to in the 
field equations. Taking the expansion for U a p, and considering terms up to the lowest 
acceptable order, we have: 

U A = A k 2 <S> + AxW + E k 2 T, 
U e = B k<S> + Bi& + F Q kT, 

Up = c fc 2 $ + c\k<f>' + c 2 $" + i k 2 r + lifer' , 

U s = D a <S>. 

Applying the Bianchi identities one can then determine the above functions in terms of 
C = Dq and g = f — 1. This gives i*o = Eq = Iq = I\ = (i.e. no V terms appearing), 
and 

kB = 2k 2 ^ % ^° k2 , A = 2g - m k B , C = 2C + 3 (^B' + 2H k B Q 
Bi=2k 2 m , ^ _ k2 , A l = -m k B l , kC 1 =3(B[+2HB 1 +kB Q ) 



Since Ai + 3HkBi = Eq = Fq = 0, the form of the Poisson Equation (890) is retained. 
However, since A\, B\ and Ci do not vanish, the field equations contain third time 
derivatives in arbitrary gauge. This is due to the presence of and corresponds to 
a higher-order gravitational theorjj^J In other words, one to simplify the equations for 
the gravitational slip, but only at the cost of introducing higher-order terms in the other 
evolution equations (to enforce consistency). 

The above way of reconstructing the J7's from the simplified approach is not unique. 
One can also relax the condition V a T^ = V a U a p = 0, but this moves us further away 
from the simplified approach. 

A further problem of the simplified approach is that it is impossible to determine 
[i and C for a specific theory without solving the field equations for a specific choice of 
initial conditions. What we would like to have is a one-to-one correspondence between 
the functions that appear in the parametrised frame-work and those that appear in the 
theories themselves, without having to solve the field equations every time (just as in the 
PPN formalism, discussed in Section [2j. This can be achieved by constructing the field 
equations as in Section |6.1| 



69 Sincc the form of the Poisson equation is retained, however, both $ and \1/ remain non-dynamical. 
Hence, the higher derivatives do not introduce additional propagating degrees of freedom. 
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6.1.3. The Hu-Sawicki frame-work 

Hu and Sawicki introduced a frame- work |630] . later generalised by Hu |627j . that 
depends on a function of time and space, gHs( T ,k), two functions of time, /f(r) and 
/g(t), and a constant, cr- Their frame- work goes beyond the simplified PPF frame- 
works described above, and tends to them in either the super-horizon or sub-horizon 
limits of the late Universal The field equations in this case are written as 



$ + * = - 



87rGa 2 



J2 Px [Sx + m{\ + w x )6 x ] - 8nGa 2 J2(Px + Px)^x + 2T HSl 



x 



x 



= 8nGa 2 J2(px + Px)^x + 9hs ($ + *), 



(913) 



where c K = 1 



3 K 

k 2 ■ 



The variable Ths is obtained by solving the differential equation 



cr 

n 



i 



r' ff5 + HT HS + 2*\T hs + -fci «!' - * 



2(1 + Sirs) ^1 

= (2H 5ffs - </rr<j)($ + *) + ^Ga 2 g HS £ [((p x + P x )^x)' + U{ Px + Px)^x] 



x 



8nGa 2 
fc 2 



[.9rrs(l + fc) + fa] £>x + PxWx + ^i^(PE + P b )0t, CM i i 



A" 



87rGa 2 
fc 2 



where #t 



v" r, — is calculated in conformal Newtonian gauge 71 

2_, v Px +Px ° ° 1 

(9131 and Eq. (914) and Px do not include contributions from the dark energy. 
Taking the sub- horizon limit, for which Hk — > 0, we get that Ths — |/g(^ 
the system then reduces to the simplified frame-work with 



In both Eq. 

OIK 

*), and 



P 

c 



1 



ffrrs 



1 + /g ' 
2.9rrs 



1 



Sirs 



(915) 



In the super-horizon limit the system obeys the Bertschinger construction (see Section 
6.1.1), with gns — 9hs( t ) while /{(t) provides the next order correction (beyond the 
Bertschinger construction). We see that the sub- horizon limit depends only on gus and 
fc, while the super- horizon limit depends only on gns and /f. The constant cr controls 
the transition scale between these two limits. 

As in the simplified approaches, it is not clear from the outset what kind of theo- 
ries this frame- work encompasses. One can find fitting functions gas, /g and that 
reproduce the solutions for specific theories, but only after some experimentation. 



70 At late times the matter content of the Universe is effectively described by dust and dark energy, 
for which the dust has no anisotropic stress. 

71 In the original formulation the comoving gauge was used, but we translated the equations here to 
make it more consistent with the other approaches we described. 
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6. 2. Models for [i and Q on Sub-Horizon Scales 

On sub-horizon scales the situation is greatly simplified. In this case one can make a 
quasi-static approximation, and discard all time variations in the perturbed fields. Let 
us now consider this regime in a few different models. 

To start with we can consider f(R) theories. Following [1043] we then have 

-k 2 (<£ + *) = ^^BKsm, 

JR 

*"* = - o f 2f /*(* + *), 

which can be rewritten in the fi~C form given above. Note that there is a scale in these 
equations, the "Compton wavelength" given by 

a 2 JR 

For large wavelengths Q — > 0, and ^ — <f> — > 0. For small wavelengths Q » 1, and 
#-$~-(2/3)/ R ($ + ¥). 

Let us now consider DGP models. In this case the expression for gravitational slip 
can be found to be |737j 



where f3 = 1 + 2eHr c WE (for more details see Section 5.5 and in particular Eqs. (812) 



and (|813j). 

Finally, let us consider scalar-tensor theories. For a generalised coupling parameter, 
F ((/)), it can be shown that 41J 



c 



G* 2{F + F 2 ) 
FG cav 2F + 3F'i 

F' 2 
F + F' 2 



The expression above can be very useful in studying the evolution of structure on 
observable scales. They can, for example, be used to correctly reproduce the growth rate 
of structure, and the effects of lensing, on scales of up to hundreds of Mpc. 

6.2.1. The importance of shear 

An important issue arises if one wishes to distinguish between the effects of dark 
energy and the effects of modified gravity. In |759j the authors constructed an explicit 
example of how anisotropic stress could mimic the effect of gravitational slip. To see 



this let us consider Eq. (889) with Us — DqQ. There is clearly a degeneracy between 



the two terms on the left hand side. In particular, if we focus on DGP then we have 
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that £ = 0, but J7 E = fffffl However, we can of course consider an alternative 
model where gravity is unmodified, but the dark energy has anisotropic stress given by 
(1 +we)^e = ~2p" gf^f ^ Af ' This would be an odd form of dark energy, but it could 
also explain observations that may otherwise have been attributed to gravitational slip. 
Distinguishing between such models is, of course, made easier when considering a wide 
variety of different scales. 

6. 2. 2. The growth function 

The primary effect of modified gravity will be on the growth of structure. The time 
evolution of the density field can be a sensitive probe of not only the expansion rate 
of the Universe but also its matter content. In a flat, matter dominated universe we 
have that 5m, the density contrast of matter, evolves as Sm oc a. We can parametrise 
deviations from this behaviour in terms of the growth function, /, given by 

f=Jk~> (917) 
nd M 



or by the introduction of a parameter 7 through 1D3TJ 1813 

/ = fi], . (918) 

Note that 7 and / are not parameters in the usual sense, but are derived quantities (indi- 
rect observables, to some extent). It is, however, sometimes convenient to parameterise 
some results or processes in terms of these quantities. 

For standard growth in the presence of a cosmological constant one has 7 ~ 6/11 to 
a very good approximation, although this can change with VLm- For the case of General 
Relativity with a dark energy component with equation of state P — wp (where w is 
constant) we have 

7 = 70 + 7l r! s + O(ft|) (919) 
where Qe = 1 — fiw, and where 70 and 71 are given by 

3(1 - w) 



To 



7o 

7i = s~ 
2w 



5 — 6w 

6w 2 - 7w + 2 2-3u; 



-~/o 



5 - 12w 

In the case of a cosmological constant (i.e. with w = —1) these expressions reduce to 

6 15 _ 

7= n + 2057^ + --" 

which gives the first order correction to the expression 7 sa 6/11, given above. 



(920) 



72 Actually, as discussed in Section |5.5| DGP has shear but the quasi-static limit imposes the condition 
fiE = 2k 2 Y,E between energy density associated with the perturbation of the Weyl tensor and the shear. 
This in turn generates the term above. 
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A natural question to ask is how the growth parameter, 7, depends on modifications 
of gravity. There have been a number of attempts at finding analytic expressions that 
relate 7 to parameters in the underlying theory. This has focused on specific theories, as 
well as the extended PPF approach. We now discuss some of the results. 

An expression for the growth parameter in the quasi-static limit of f(R) theories was 
found in |58j . Here one can define a time dependent mass scale: 

M 2 (a) ' 



where fan is the second functional derivative of / with respect to the Ricci scalar, R, 
evaluated at the General Relativistic value of R. A non-local expression for 7 can then 
be found of the form 

6 n A k 2 _ u/2 r d 3 / 2 dd 



7 = — — — a 



11 2ft M J fc 2 + 5 2 M 2 (a)' 

One can see that for k -C aM(a) this expression for 7 remains the same as in General 
Relativity, but that once k crosses the mass threshold modifications start to kick-in. 
In DGP models the correct expression for 7 is given bjj^ |501j 

117 93 

16 + 5632°-" 4096°- + < 921 > 

This result is in excellent agreement with numerical studies (to within 2% or better for 
n E < 0.8, and to 5% for Q E < 0.9). 

An attempt at finding approximate analytic expressions for 7 was presented in |501j . 
In this report we will restrict ourselves to limiting cases where £ ~ Ci^-B- If we then 
assume that we can Taylor expand 7 as in Eq. ( |919[ ) then we find the coefficients 

71 = ^ [-3Yi - (2 - 3w)Y 2 + 4F 2 ] , (923) 



where we have defined 
1 - w + Ci 



Y 2 = 



w(5 — 6w) ' 

(1 - w)(l5w 2 -4w-l) + Ci(9w 2 + 2w - 2) Ci 



2w 2 (12w - 5) (5 - 6w) 2w 2 (l2w - 5) (5 - Gw) ' 



A comparison between the approximation described above and the exact numerical value 
of 7 is shown in Figure |14| More general expression for 7 that include the impact of /it, 
varying w, higher-order terms in fig, as well as scale dependent correction in Rk are 
presented in |501j . 



73 Incorrect values for 7 in DGP models have been presented in 816 41 1263 . 
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Figure 14: The growth parameter, 7, for a selection of gravitational slip parameters of the form £ = Ci^Ei 
as a function of Qm- The dashed curves are the numerical results for £1 = 0, 0.2, 0.4 and 0.6 in ascending 
order, and the corresponding analytic approximations arc plotted as solid lines. 

6.2.3. Current constraints on the PPF parameters 

At the time of writing this report, there have only been a few attempts at constraining 
the PPF parameters using existing cosmological data sets [123[ I361| 11302] . Let us now 
consider each of these in turn. 

In |123) the authors find constraints on Q = \i and R = 1 — £ using the Union 
set of Supernovae from the Supernovae Cosmology Project (with an additional sample), 
the joint 2dF and DR7 SDSS survey estimate of the Baryon Acoustic Oscillations, the 
WMAP 7-year CMB Temperature and Polarisation data, the matter power spectrum 
from the SDSS DR7 release, auto and cross-correlation functions from 2MASS and the 
SDSS LRG catalogue with ISW and the COSMOS weak lensing survey. 

A scale and time dependent form of the PPF parameters is proposed: 



Setting k c — > 00 and s = the authors then find time independent constraints on the 
parameters: 0.97 < Q < 1.01 and 0.99 < R < 1.02 (both at the 95% confidence level). 
Their analysis shows that primary constraints come from the WMAP7 data, and that 
current weak lensing and cross correlations between galaxies and ISW have a minimal 
effect. Allowing time dependence greatly relaxes the constraints on the parameters: 
1.04 < Q < 2.66 and -0.22 < R < 1.44 (both at the 95% confidence level). If a 
transition scale of k c — 0.01 Mpc -1 is considered, parameters are then constrained to be 
0.47 < Q < 3.49, -0.80 < R < 2.52, 0.97 < Qoo < 2.65, and -0.43 < R x < 1.76. 

The analysis of Daniel et al j361J extends the above analysis to include an incarnation 
of the CFHTLS lensing survey, which has substantially larger coverage than that of the 
COSMOS survey. An earlier release of WMAP data is used, and cross correlations 



Q(k, a) = 1 + [Q e fc/fec + Q«,(l - e fe/fcc ) - l]a s , 
R(k, a) = 1 + [R e k/k " +R SX> (1- e k ' k ") - l]a s . 
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between large scale structure and ISW are not included. The authors focus on scale- 
independent parameters, ( = ui = u> a 3 and /i = ^qo 3 , and find —1.4 < u> < 2.8 and 
—0.67 < fio < 2.0. The authors also consider a more elaborate time evolution of both Q 
and fi by dividing it up into bins in the following redshift ranges: (a) [0, 1] , (b) [1, 2] and 
(c) [2, 9]. Setting G) = they then find that /i is constrained to be: —0.074 < /iQ a < 0.08, 
—0.058 < fiob < 0.14, —0.023 < /ioc < 0.22. Setting fi = 1 they find that Cj is constrained 
to be: -0.074 < w 0a < 0.08, -0.058 < Q ob < 0.14, -0.023 < w 0c < 0.22. 

In |1302j the authors consider two different parameterisations. In the first they con- 
sider a scale independent but time evolving parametrisation for t]zg and ^zg, given by 



Eq. (9081 as 



1 //,. / : - : 

^zg(z) = — 7. — I 1 + tanh 



Az 

z — z. 



( I • t i 1 1 1 1 1 A 



Mo, 



These authors use essentially the same data sets as in |123j , albeit with an earlier release 
of the WMAP data (5 year) and without BAO constraints. They find that if z s = 1 
then the constraints are 0.65 < < 1-9 and —0.41 < t/q < 2.18, while if z s — 1 the 
constraints are 0.68 < < 1-11 and 0.7 < i] < 1-9. 

In a second parametrisation the authors bin the parameters in both time and scale. 
They choose not to bin in 77, but rather in Y>zg = /izG ( 1 + T?ZG ) , This is the combination 
that enters into the calculation of both the ISW effect and weak lensing through their 
dependence on $ + 'J. They consider the following bins: (a) k £ [0,0.1], z € [1,2], (b) 
k e [0,0.1], z e [0,1], (c) k e [0.1,0.2], z € [1,1], and (d) k € [0.1,0.2], z e [1,2]. The 
resulting constraints are then as follows: In bin (a) 0.96 < Si < 1.04, 0.64 < /^i < 1.42, 
in bin (b) 0.93 < S 2 < 1-07, 0.65 < ^ 2 < 1-34, in bin (c) 0.58 < E 3 < 1-02, 0.24 < // 3 < 
2.24, in bin (d) < S 4 < 2.23, 0.05 < ^4 < 2.46. 

6.2.4- Constraining the growth rate 

Over the past few years there have been attempts to target the growth rate directly 
using redshift space distortions. This method involves measuring either the redshift 
space correlation function, £, s (r\\ , rj_) (where the r|| and r± correspond to parallel or 
perpendicular directions to the line of sight), or the redshift space power spectrum, 
Pg{k). The power spectrum can be related to the real space power spectrum through an 
extension of what is known as the 'Kaiser formula': 



P s (k) = P(k)[l + 2^P A + fJ. 4 nG 



k ■ h fa* 
H 2 {z) 



(924) 



where fi = t, b is the bias factor, G(x) encodes the non- linear effect due to velocity 
dispersion, and fj, is the cosine of the k vector with the line of sight. Note that P s (k) 
becomes anisotropic (as does the correlation function) and it is through this anisotropy 
that one can measure /3 and hence /. 

Until recently, measurements of (3 were seen as constraints on Qm- The reason for 
this is that in AC DM we have /? ~ il^ 11 (z)/6, and hence measurements of fi at different 
redshifts can be used to reconstruct the history of £Im- In [585] it is found that the wide 
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part of the VIMOS-VLT Deep Survey (VVDS) can be used to obtain j3 = 0.70 ± 0.26 
at z ~ 0.8. This result is then combined with the constraint from the 2dFGRS, /3 = 
0.49 ±0.09 at z ~ 0.15, and the 2dF-SDSS LRG and QSO (2SLAQ) constraint (shown in 
Figure 15 ). The emphasis in [585j was on finding deviations from growth rate in AC DM, 
and although not done systematically, their analysis showed that a number of specific 
models could be ruled out. 
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Figure 15: The growth rate, /, as a function of redshift, z, for a number of different models. The symbols 
and error bars correspond to constraints from VVDS, 2dFGRS and WiggleZ. Taken from 11651 . 

The analysis of |585j have led to a number of upcoming redshift surveys focusing, 
in part, on redshift space distortions. In particular, WiggleZ, FMOS, VIPERS, GAMA 
and BOSS, many of whom were primarily targeting large scale structure and the Baryon 
Acoustic Oscillations, will now all deliver constraints on f3 in the near to medium future. 

In [1159 it was argued that measurements of the growth of structure through con- 
straints on the large scale peculiar velocity dispersion, a% (the large-scale redshift space 
distortion), can be combined with weak lensing measurements to break the degeneracies, 
and target /i more accurately. With measurements at different redshifts it should be 
possible to reconstruct the time evolution of /x. 

6.2.5. The Eq diagnostic 

The effect of modified gravity on the gravitational potentials, $ and \P ', can in principle 
be teased out with a careful combination of measurements. In |1301) it was argued that 
this should be possible using a cross-correlation between an estimate of the local velocity 
field from redshift space distortions, and the gravitational potentials inferred from a 
lensing map from background galaxies. More specifically, from a more general form of 
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Eq. (9241, 



P s (k) = P s (k)[P g (k) + 2^P g9 (k) + fi 4 P e (k)]G 



H 2 (z) 



(925) 



it is possible to estimate the galaxy- velocity power spectrum, P g g, in bandpowers P a , 
and the galaxy-galaxy power spectrum, P g , in bandpowers Pj. From a weak lensing 
measurement of the convergence 



where W is the lensing kernel, it is possible to construct a cross power spectrum with 
galaxies, C Kg (£) ~ J2 a fa(£)Pa , where P a ' is the band power estimate of Pv 2 (*-*)g- 
These band powers can all be collected into one estimator, 

- _ C Kg (£,A£) 
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where a band averaging over bins of width Al has been assumed. It can be shown that 
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The diagnostic Eq will take different values depending on the theory of gravity: 
E G = n /(3 in ACDM and DGP (with different fl Q in either case), and E G = fi /(l+/ fi )/3 
in F(R). In TeVeS Eq is significantly different from the ACDM value, and is scale 
dependent. Hence, Eq is in principle a good diagnostic of the underlying gravitational 
theory. 

In |1060j . the authors attempted to extract an estimate of Eq from a sample of 70,205 
luminous red galaxies (LRGs) from the SDSS. Their estimate was concentrated at a mean 
redshift of z ~ 0.32 and they found 8 estimates of Eq across a range of between 2h^ 1 to 
50ft,- 1 Mpc, with a mean of E G = 0.40 ± 0.07. This should be compared with the ACDM 
prediction of E G = 0.408 ± 0.029, the F(R) prediction of E G = 0.328 - 0.365, and an 
approximate TeVeS prediction of E G ~ 0.22. The authors of [1060j have argued that this 
is evidence for the validity of General Relativity on cosmological scales. The results from 
|1060j are somewhat tentative and preliminary, but nonetheless promising. A judicious 
choice of cosmological parameters may indeed be able to tease out the particularities of 
how $ and $ evolve in different theories of gravity. With planned mega surveys such 
LAMOST, LSST and SKA it is likely that very tight constraints on Eq will be achievable 
in the future. 



6. 3. Forecasting Constraints from Future Surveys 

There is still quite a way to go in terms of constraining deviations from General 
Relativity. As mentioned in the previous section, with the planned mega surveys it 
should be possible to reconstruct the time evolution of /i and C with ever increasing 
precision [4311 432 . With the projected quality of the data, it also makes sense to 
consider model independent parameterisations of /i and £ using some form of Principal 
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Component Analysis (PCA). The authors in |1304] have done just this, and attempted to 
quantify how many eigen-modes in an expansion of fi and £ can be accurately constrained 
with surveys such as Planck, DES, LSST, SNAP and JDEM. They find that while with 
DES it should be possible to constrain up to 20 independent eigen-modes, LSST will 
improve this by a factor of 5, allowing detailed reconstruction of the time evolution in 
modified gravity. They find that in the best case scenario LSST will accurately constrain 
the time and scale dependence of /x and £ in the range 0.5 < z < 2 and 0.04 < fc0.16 h _1 
Mpc. 
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7. Discussion 

In this review we have discussed recent advances in gravitational physics, many of 
which have been driven by the discovery of the dark side of the Universe in the late 1990s. 
In this final section let us now briefly consider the outlook for future gravity research, 
and, in particular, what we consider to be some of the upcoming highlights. 

From a theoretical perspective, model building is an important part of understanding 
and explaining existing data, as well as making predictions for the future. Tight con- 
straints already available on solar system and astrophysical scales, however, mean that 
model builders are presented with a choice: They can either study minimal deviations 
away from General Relativity, or must otherwise look for mechanisms that hide modifi- 
cations to gravity on the scales probed by experiment. The former of these has value for 
understanding the special nature of General Relativity, and the consequences of moving 
away from it, while the latter provides an exciting opportunity to try and solve some of 
the cosmological puzzles that have arisen with the discovery of dark matter and energy. 

Modified gravity necessarily involves additional fields, extra dimensions, or broken 
symmetries, since we know that GR is the unique diffcomorphism invariant theory of a 
single rank-2 tensor that can be constructed from the metric variation of an action in 
four dimensions. An important consideration is then whether or not these deviations 
manifest themselves at the level of the background cosmology, or merely at the level of 
perturbations. Of course, if we wish to account for dark energy, or solve the cosmo- 
logical constant problem using modified gravity, these deviations must be manifest in 
the solutions of the Friedmann equations. We must also require, however, that they do 
not spoil the successful predictions of the standard cosmology such as the abundance 
of light elements, the peak positions of the CMB acoustic spectrum, or the predictions 
for baryon acoustic oscillations. This requires the background FLRW cosmology of the 
modified theories to closely mimic the standard evolution of ACDM from nucleosynthesis 
through to matter domination. 

Let us now consider the assumptions that go into the standard cosmology. These 
include 

• Einstein's field equations. 

• The universality of free fall. 

• Local Lorentz invariance. 

• Three spatial dimensions (below the electro- weak scale, at least). 

• Conservation of energy-momentum. 

• Homogeneity and isotropy of space on large scales. 

• Matter fields being well modelled by fluids of dust and radiation. 

These assumptions, when confronted with observations, require the presence of dark 
matter and dark energy. From a particle physics perspective this is very discomforting, as 
we have discussed in Section [2.6.6( as it requires the vacuum energy to be at least 60 orders 
of magnitude smaller than expected from particle theory. This problem is exacerbated 
by the fact that contributions from the zero-point energy of massive particles changes as 
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the universe cools and goes through phase transitions. To circumvent the difficulties that 
this entails, many model builders therefore assume some unknown symmetry that sets 
the vacuum energy, plus any contribution from a bare cosmological constant, to zero. 
To account for cosmic acceleration they then require modifications to gravity on large 
scaleJ^l 

In some respects addressing the puzzles in cosmology is the easy part of the model 
builders job. More difficult is to do this in a way that is consistent with the well estab- 
lished observations of relativistic gravitational phenomena. This often involves identify- 
ing a screening mechanism that can work on solar system scales. The point here is that 
in order to modify the cosmological evolution we have to deviate considerably from GR 
in the IR, but IR modifications of gravity are known to cause strong deviations from the 
predictions of General Relativity for experiments that involve, for example, the bending 
or time delay of light. Deviations on the scales that these observations are made must 
therefore be suppressed. Currently the most popular examples of this type of mechanism 
are the Vainshtein mechanism and the chameleon mechanism, both of which we have 
discussed in this review, and neither of these is without its problems. It is therefore 
important for theorists to identify new ways to screen the deviations from GR at short 
distances, and we expect this to be an important avenue of future research. 

It is also crucial to ensure that our models obey some basic requirements of stability. 
There is little motivation for carrying out complicated simulations, and devising expen- 
sive experiments, to probe theories that are fundamentally sick. For example, we may 
choose to ask if the theory contains ghosts, which can lead to catastrophic instabilities 
unless the mass of the ghost lies above the effective theory cut-off, as described in Section 
|2.1.3| Ghosts are rife in modified theories of gravity, and dealing with them is one of 
the principle challenges faced by model builders. Another important thing to establish 
is the cut-off for our effective classical theory. Modified theories of gravity can become 
strongly coupled at lower than expected energies, and enter a quantum fog. If we want 
an effective theory description of gravitational fields at the surface of the earth to be 
classical, we need to impose that the cut-off is at least of the order of a few meV, since 
classical gravity has been tested in the lab at this scale. Conservatively, one may also 
wish to impose that the classical description of gravitational fields around the Sun should 
be trustworthy, at least down to its Schwarzschild radius. 

Let us now consider the prospect of future input into this field from experiments and 
observations. Tests of Lorentz invariance and the weak equivalence principle, that are 
already well established on Earth, are now being proposed and planned as space mis- 
sions. Space offers a number of benefits over Earth-based experiments, including a lack 
of seismic noise, and the fact that cooled atoms stay in interferometers longer. There are 
also theoretical reasons for wanting to test these foundational issues of gravity in space, 
such as the proposal by Khoury and Weltman that extra gravitational degrees of free- 
dom could have an environmental dependence. Space based tests of Lorentz invariance 
offer the possibility of improving constraints on violations of this symmetry by orders of 
magnitudes (see, e.g. |172U869j l. Space based tests of the equivalence are being planned 
by the French space agency CNES, under the name MICROSCOPE (MICRO-Satellite 
a trainee Compensee pour l'Observation du Principe d'Equivalence), and by ESA and 



74 Some approaches, such as degravitation, (see Section [5.6.2[ l even use modifications to gravity in order 
to screen the effects of the iarge vacuum energy, thereby removing the need for the unknown symmetry. 
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NASA, under the name STEP (Satellite Test of the Equivalence Principle). These two 
missions promise to increase bounds on violations of the weak equivalence principle to 
the level of 1 part in 10 15 , and 1 part in 10 18 , respectively. 

With regards to space based tests of metric theories of gravity, there is also hope 
for further improvement on current bounds. The Bepi-Columbo Mercury orbiter being 
planned by ESA will, after a two year mission, be capable of placing constraints on PPN 
parameters of the order 7 - 1 - 3 x 1(T 5 , /? - 1 ~ 3 X 1(T 4 , and a x ~ 1CT 5 [SM]. The 
bounds on 7 could be improved further by Gaia, a high precision space telescope that 
could constrain 7 — 1 to around 1 part in 10 6 . Still greater constraints may be possible 
with LATOR (Laser Astronomic Test of Relativity) 1230 . This mission consists of 
two satellites that orbit the Sun at 1AU, and will have the potential of being able to 
constrain 7 — 1 to around 1 part in 10 8 , and solar frame-dragging to the level of ~ 1%. 
Such constraints are orders of magnitude greater than those that are currently available. 

Lunar laser ranging has played an important part in testing gravity over the past 
4 decades, since retroreflectors were placed on the moon by Apollo astronauts, and 
the Soviet Lunokhod rovers |885) . Improvements in ground based technology during 
this time have improved the bounds on PPN parameters that reflected lasers have been 
able to impose, but we are now reaching the stage where further gains will be limited 
by the retroreflectors themselves. Tightening the current bounds using this technology 
will therefore require new space missions, and, in particular, the possibility of planting 
retroreflectors on other planets has great potential. Laser ranging of Mars over a 10 
year period would allow 7 — 1 to be constrained to around 1 part in 10 6 |1231j , and the 
Nordtvedt parameter 77 = 4/3 — 7 — 3 to the level of ~ 6-2 x 10 -6 [J7]. Again, these are 
order of magnitude improvements on current bounds. 

Moving beyond the solar system, binary pulsars are an excellent test of relativistic 
gravity, and offer the possibility of becoming more constraining than solar system test in 
the near future |1177j . Pulsars also offer the opportunity to test gravity through the emis- 
sion of gravitational waves. In particular, pulsar- white dwarf systems have great potential 
to constrain the emission of dipolar gravitational, which is a generic prediction of a large 
number of modified theories of gravity. Binary systems PSR J1141-6545, J0751+1807 
and J1757-5322 are all recently discovered pulsar-white dwarf systems. What is more, 
continued observation of existing pulsars also offer the possibility of new tests of gravity, 
as, for example, the perihelion of PSR B1913+16 precesses it may soon allow for tests 
of the Shapiro time-delay effect. Future prospects for testing gravitational physics using 
pulsars are also bright due to the large numbers of these objects that are expected to be 
found by the Parkes, Arecibo, and Green Bank telescopes, as well, of course, as the SKA 
(Square Kilometre Array). The chance of detecting a pulsar-black hole systems increases 
dramatically with large-scale observations of this kind. Such a system would be poten- 
tially of great importance for testing strong field gravity. Finally, the double pulsar PSR 
J0737-3039A,B also offers a unique test of gravitational physics, with excellent prospects 
for improving constraints on gravity in the future as observations of it continue. 

In all of these areas it is likely that the bounds on deviations from General Relativity 
will continue to be tightened, with lab tests too promising continued improvement. The 
Eot-Wash group at the University of Washington, and others, continue to increase bounds 
at ever smaller scales, and even particle experiments using the LHC at CERN are looking 
for the signs of the extra-dimensions that are crucial for so many modern theories of 
gravity. The future prospects for constraining all of these aspects of gravity means that 
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the extra degrees of freedom in modified theories of gravity will have ever smaller regions 
of parameter space in which to hide. This also, of course, means that the possibility 
of making a detection of a deviation from General Relativity is improved, if any such 
deviations really do exist in nature. 

It is, of course, the case that there have been tremendous developments in observa- 
tional cosmology over the past couple of decades, and these observations have put gravity 
in the spotlight once again. With measurements of the CMB, weak lensing, and galaxy 
surveys, as well as probes of the expansion rate with distant supernovae Ias, a strange 
Universe has been uncovered in which more than 95% of the energy budget is in some 
exotic dark form. The quality of these observations are such that it is difficult to avoid 
such a conclusion if the gravitational force arises from Einstein's theory of gravity. An 
alternative point of view is that these observations are pointing at a flaw in our un- 
derstanding of the behaviour of the Universe on the largest scales, and of behaviour of 
gravity at these distances in particular. Indeed, these observations may be a sign that 
we must think beyond Einstein's General Theory of Relativity, and his field equations. 

In the same way that cosmological observations may be hinting at new physics in the 
gravitational sector, they can also be used to constrain and even rule out alternatives. 
With a vast range of experiments being planned and constructed throughout the world, 
it seems that we are a critical juncture in the path to understanding gravity. At ESA, the 
satellite mission Euclid is under assessment. This mission could map out vast regions 
of space, probing the growth rate and morphology of large-scale structure, both key 
observables for constraining theories of gravity through their effects on gravitational 
collapse. The SKA is in a planning phase with path finders being constructed on two 
continents. SKA will be a vast radio telescope that will generate a survey of up to a 
billion radio galaxies, mapping out the evolution of structure back to extremely high 
redshifts. These observatories will also be competing against the Large Synoptic Survey 
Telescope (LSST) that will image the sky over a period of ten years, building up a survey 
of galaxies that will primarily be used for weak lensing. 

These larger experiments will be complemented by smaller, more rapid surveys such 
FastSound, Weave, Boss, Viper, KIDS, DES and a host of other collaborations that will 
produce galaxy and weak lensing surveys on smaller scales, targeting different redshifts. 
Interestingly enough, while the original scientific driver for many of these surveys was 
to constrain dark energy, they have all taken on board the need to test gravity. Indeed 
measuring the growth rate of gravitational collapse, as a test of modified gravity, has 
become the core business in all of these surveys. 

Einstein developed his General Theory of Relativity almost a century ago, and, al- 
though it remains a cornerstone of modern physics, one could argue that of all the 
fundamental forces of nature it is gravity that remains the least well understood. This 
is almost certainly due to the weakness of the gravitational interaction, which makes 
it incredibly difficult to test in the lab experimentally. Inevitably, experiments on the 
scale of planets, stars, galaxies, and beyond cannot be performed with the same level of 
precision and control as those conducted for the other forces on Earth. Never the less, 
technology is now starting to catch up with gravity. The latter half of the twentieth 
century may have belonged to the Standard Model of particle physics, but there is every 
reason to suspect that the twenty first century will belong to gravity. 
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